Nucleic Acid Molecules Encoding Polypeptides Having The Enzymatic 
Activity Of An RNA-dlrected RNA Polymerase (RdRP) 



The present Invention relates to nucleic acid molecules encoding 
polypeptides having the enzymatic activity of an RNA-dlrected RNA 
polymerase (RdRP). The present invention also provides vectors comprising 
said nucleic acid molecules, wherein the nucleic acid molecules are 
operatively linked to regulatory elements allowing expression In prokaryotic 
and/or eukaryotic host cells. Additionally, the present invention relates to 
polypeptides encoded by said nucleic acid molecules and methods for the 
production of said polypeptides. The present invention further relates to 
pharmaceutical and diagnostic compositions as well as kits comprising the 
aforementioned nucleic acid molecules and/or comprising a nucleic acid 
molecule which is complementary to such a nucleic acid molecule. Said 
compositions and kits may further comprise polypeptides encoded by the 
described nucleic acid molecules. Furthermore, the present inventton relates 
to antagonists and Inhibitors of the aforesaid polypeptides and/or antitxKlies 
speclflcdily recognizing such polypeptides. Also within the scope of the 
present invention are methods and uses comprising the nucleic acid 
molecules, vectors, polypeptides, antibodies and antagonists and Inhibitors of 
the invention for modulating gene expression in humans and animals. 
Furthemnore. the present Invention relates to transgenic plant cells and plants 
containing the aforementioned nucleic acid molecules as well as the use of 
the aforementioned nudeic acid molecules, polypeptides and/or 
antagonists/inhibitors in plant cell culture and plant tissue culture and/or plant 
breeding. 
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Background of the Inventfon 

Since the development of plant transfomrtatlon systems a particular Interest 
exists on the stable expression of foreign genes In higher plants. Although 
these attempts were successfully realized for several herbicide resistance 
genes, a series of experiments failed that had been designed to use plants as 
efficient bio-reactors. It seemed that higher plants have a system of defense 
to protect themselves against overexpression of foreign genes. Whenever a 
tranagene expression comes up to a threshold dose it is without selective 
pressure silenced either by transcriptional or post-transcriptional inactivatlon. 
Recent studies on these phenomena indicated that "antisense" RNA might be 
responsible for these silencing mechanisms. The idea of an "antisenso" RNA- 
mediated gene silencing was substantiated by the analysis of transgenically 
mediated vims resistance in plants (Smith et a/., 1994: English Bt aL, 1996; 
Sijan et a/., 1996). It could be shown that the In vivo transcripts of non- 
translatable "sense" cDNA constructs which had been integrated into the 
plant genome can mediate resistance against infection with plant viruses. 
Further analysis revealed that this resistance which was due to post* 
transcriptional gene silencing (PTGS) only occurred when the transgene 
shared homology with the Infecting virus. Based on these data and with 
regard to previous results describing the co-suppression phenomena in plants 
{for review see Meyer, 1996) several models explaining the PTGS 
mechanism have been Introduced (English et ai. 1996; Baulcomb, 1996; 
Sljen et aL, 1998). Common to all these models Is the assumption that 
"antlaense* RNAs are synthesized from "sense" RNA templates by an RNA- 
directed RNA polymerase (RdRP). Subsequently the produced "antlsense" 
RNAs can hybridize to complementary parts of mRNAs or Invading virus 
RNAs which would unresistingly lead to degradation of the double-stranded 
regions. 
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It is most likely that this PTGS is not restricted to transganes but that It Is also 
involved in normal plant gene regulation. Moreover* it is most likely that the 
same mechanism takes place In all higher eukaryots. Therefore, a detailed 
examination of RNA-med!ated gene regulation had become an Important 
aspect of basic and applied research. An RNA-dependent RNA polymerase 
(EC 2,7.7.48, RdRP) activity has been detected In healthy plant tissue, for 
review see Fraenkel-Conrat (1986). Previous studies of RdRP activity have 
suffered from the inhomogenelty of enzyme pnspanations and of the resulting 
RNA products In that they did not allow the precise determination of their 
template complementarity by direct RNA sequencing. Schiebel et al. (1993 a, 
b) provided evidence for the concept that the RdRP mediated transcription is 
a truly RNA-lnstructed process yielding products that are precise 
complementary copies of the RNA template offered to an RdRP active 
enzyme preparation. However, arthough purification of the RdRP from tomato 
leaf tissue to electrophoretic homogeneity was reported (Schlet>ei et al., 1993 
a, fa), the enzyme preparations were not approachable for amino acid 
sequencing. One reason for the slow progress in elucidating the amino acid 
sequence of the RdRP is to be seen in the fact that only low amounts of 
protein could be isolated using the hitherto available technology and/or that 
there existed further proteins in the enzyme prepanations obtainable with 
standard techniques which resulted in non-Informative or even false 
sequence Information. However, for efficient use of RdRP associated 
technology, it was desirable to be able to manipulate the genetic material 
associated with said technology. 

Thus, the technical problem underlying the present invention is to provide 
such material. 
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Description of tho invention 

The solution to the technical problem is achieved by providing the 
embodiments characterized in the claims. 

Accordingly, the invention relates to a nucleic acid molecule encoding a 
polypeptide having the enzymatic activity of an RNA-directed RNA 
polymerase (RdRP) or encoding an enzymaticaily active fragment thereof 
selected from the group consisting of: 

(a) nucleic acid molecules encoding a polypeptide comprising the amino 
acid sequence given in SEQ ID NO:2; 

(b) nucleic acid molecules comprising the nucleotide sequence given in 
SEQ ID NO:1; 

(c) nucleic acid molecules hybridizing with a complementary strand of a 
nucleic acid molecule as defined in (a) or (b); and 

(d) nucleic acid molecules, the nucleotide sequence of which is 
degenerate as a result of the genetic code to a nucleotide sequence of 
a nucleic acid molecule as defined in (c). 

In accordance with the present invention, a nucleic acid molecule encoding a 
novel class of RdRPs has been identified. This has been achieved by using a 
novel purification method for the isolation of a polypeptide having the 
enzymatic actlvfty of an RdRP which was suitable for amino acid sequencing. 
Oligonucleotides were designed and used for cloning of the corresponding 
cDNA, 

The polypeptide encoded by the nucleic acid sequence of SEQ ID NO: 1 from 
nucleotide 194 to nucleotide 3535 encodes a polypeptide of a 1114 amino 
acids with a calculated molecular weight of about 127 kDa. This data is in 
good agreement with the prior art experimental values that had been 
detennined by SDS-PAGE (128 kD) and sucrose gradient centrifugation (119 
kD) (Schiebel et al., 1993a), The nucleotide and the amino acid sequence 
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given in SEQ ID NOS: 1 and 2. respectively, displayed no significant 
homologies to sequences in any database (Blest X, GAP. Wisconsin 
Sequence -Packaging System. Version 7.0) searched. In addition, the 
presence of putative phosphorylation sites for putative protein kinases can be 
identified by computer aided search for phosphorylation consensus 
sequences which suggests possible regulation by phosphorylation. Studies 
which had been carried out in accordance with the present invention revealed 
that the polypeptide encoded by the nucleotide sequence of SEQ ID NO:1 is 
identical with the RdRP isolated from tomato plants as described in example 1 
of the present application. 

The polypeptide encoded by the nucleotide sequence of SEQ ID NO:1 
displays the ability to catalyze In vitro transcription of preferably short single- 
stranded RNA and DNA molecules into precisely complementary RNA copies 
up to the full length of these templates. The RdRP-directed transcription can 
be primed by RNA and DNA oligonucleotides complementary to the 3'- 
terminal nucleotides of the template. In addition, the RdRP encoded by the 
nucleic acid molecule of the Invention can catalyze unprimed transcription. 
An unprimed transcription starts preferentially at the 3 -tenminal nucleotides of 
a con^espondlng template. Furthemnore, the RdRP is capable of adding a 
single noncomplementary nucleotide to the 3' terminus; see also Schiebel et 
al. (igg3b). 

The term "RNA-direcled RNA polymerase" (RdRP), as used herein, means 
that said polypeptide or enzymatically active fragment thereof Is capable of 
RNA-directed RNA synthesis, thus using RNA as a template for synthesizing 
complementary RNA molecules. However, as set forth above, said RdRP may 
also be capable of accepting alngie-stranded DNA motecules as templates for 
RNA transcription. 

The availability of the nucleic acid molecules encoding the RdRP is definitely 
a major advantage because now experiments are possible that allow both the 
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"knock-out" and the improvement of the regulation mechanism for a given 
gene. 

The nucleotide sequence depicted in SEQ ID NO: 1. In general, encodes a 
novel class of polypeptides having RdRP activity. By the provision of this 
nucleotide sequence, it is now possible to isolate identical or homologous 
nucleic acid molecules which encode polypeptides with the enzymatic activity 
of said polypeptide from other species or organisms. 

Thus, the Invention also relates to nucleic acid molecules hybridizing with the 
above-described nucleic acid molecules and differ in one or more positions in 
comparison with these as long as they encode a polypeptide having RdRP 
activity. Such molecules comprise those which are altered, for example, by 
deletion(s), insertion(s), substitution(8), addltion(s) and/or recomblnation(s) or 
any other modification(8) known in the art either alone or in combination in 
comparison to the above-described nucleic acid molecules. Methods for 
introducing such modificatksns in the nucleic acid molecules according to the 
Invention are well-known to the person skilled in the art. The invention also 
relates to nucleic acid molecules the sequence of which differs from the 
nucleotide sequence of any of the abov&<J escribed nucleic acid molecules 
due to the degeneracy of the genetic code. 

By "hybridizing" it is meant that such nucleic acid molecules hybridize under 
conventional hybridization conditions, preferably under stringent conditions 
such as described by, e.g.. Sambrook et al. (Molecular Cloning: A Laboratory 
Manual, 2nd Edition. Cold Spring Hartjor l-aboratory Press. Cold Spring 
Harbor, NY (1989)). 

In a preferred embodiment the nucleic acid molecules according to the 
invention are RNA or DNA molecules, and most preferably cDNA or genomic 
DNA. It should also be understood that the nucleic acW molecules of the 
invention may be synthetically synthesized DNA or RNA molecules or hybrids 
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thereof. Preferably, said nucleic acid molecules may be prepared from nucleic 
acids obtained from any organism, tissue or cell, namely from bacteria, 
eucaryotes or archaebacteria and preferably from mammalian, plant insect, 
bacterial, or fungal cells, worm cells such as derived from C. elegans or 
viruses. In a further most preferred embodiment the nucleic acid molecule of 
the invention is derived from a plant, preferably from tomato. 

Nudeic acid molecules hybridizing with the above-described nucleic add 
molecules can, In general, be derived from any organism comprising such 
molecules, preferably from monocotyledonous or dicotyledonous plants, in 
particular from any plant of interest in agriculture, horticulture or wood culture, 
such as crop plants, namely those of the femiiy Poaceae, any other starch 
producing plants, such as potato, maniok, leguminous plants, oil producing 
plants, such as oilseed rape, llnanseed, etc., plants using polypeptide as 
storage substances, such as soybean, plants using sucrose as storage 
substance, such as sugar beet or sugar cane, trees, ornamental plants etc. 
Preferably the nucleic add molecules according to the invention are derived 
from plants belonging to the family Solanaceae. Identical or similar 
polypeptides may also be found In animals, preferably mammals. Nucleic acid 
molecules hybridizing to the above-described nudeic acid molecules can be 
isolated, e.g., form libraries, such as cDNA or genomic libraries by techniques 
well known in the art. For example, hybridizing nudeic acid molecules can be 
identified and isolated by using the above<lescribed nucleic acid molecules or 
fragments thereof or complementary sequences thereof as probes to screen 
libraries according to standard techniques. Possible is also the isolation of 
such nucleic acid molecules by applying the polymerase chain reaction (PGR) 
using as primer oligonucleotides derived form the above-described nudeic 
add molecules. 

Nucleic acid molecules which hybridize with any of the aforementioned 
nudeic add molecules also include fragments, derivatives and allelic variants 
of the above-described nucleic acid molecules that encode a polypeptide 
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having RdRP activity or a enzymatically active fragment thereof. Fragments 
are understood to be parts of nucleic acid molecules long enough to encode 
the descnbed polypeptide or an enzymatically active fragment thereof. The 
term "derivative" means in this context that the nucleotide sequence of these 
nucleic acid molecules differs from the sequences of the above-described 
nucleic acid molecules in one or more nucleotide positions and homologous 
to said nucleic acid molecules. Homology Is understood to refer to a 
sequence identity of at least 40 %, particularly an Identity of at least 60 %. 
preferably more than 80 % and still more preferably more than 90 %. The 
deviations from the sequences of the nucleic add molecules described above 
can. for example, be the result of substltution(8). deletion(s), addltion(s), 
insertion(8) and/or recombination(s) either alone or in combination. 

Homology further means that the respective nucleic acid molecules or 
encoded polypeptides are enzymatically and/or structurally equivalent. The 
nucleic acid molecules that are homologous to the nucleic acid molecules 
described above and that are derivatives of said nucleic acid molecules are, 
for example, by way of variations of said nucleic acid molecules which 
represent modifications having the same enzymatic function. They may be 
naturally occurring variants, such as sequences from other plant varieties or 
species, or obtained by mutation. These mutations may occur naturally or 
may be obtained by mutagenesis techniques. The allelic variants may , be 
naturally occurring allelic variants as. well as synthetically produced or 
genetically engineered variants. 

The polypeptides encoded by the various derivatives and variants of the 
above-described nucleic acid molecules share one or several specific 
common characteristics, such as enzymatic activity, molecular weight, 
immunological reactivity, confomnatlon, etc., as well as physical properties, 
such as electrophoretic mobility, chromatographic behavior, sedimentation 
coefficients. pH optimum, temperature optimum, stability, solubility, 
spectroscopic properties, etc. 
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All such fragments, variants and derivatives of the RdRP of the invention are 
included _within the scope of this invention, as long as the essential 
characteristic RdRP activity as defined above remains unaffected in kind. 

In a ^rther embodiment, the Invention relates to nucieic acid molecules of at 
least 15 nucleotides in length hybridizing specifically with a nucleic acid 
molecule as described above or with a complementary strand thereof. This 
means that they hybridize, preferably under stringent conditions, specifically 
with the nucleic acid molecules as described above and show no or very little 
cross-hybridization with nucleic acid molecules encoding other polypeptides. 
Such nucleic acid molecules may be used as probes and/or for the control of 
gene expression. Nucleic acid probe technology is well known to those skilled 
in the art who will readily appreciate that such probes may vary in length. The 
nucleic acid probes of the invention are useful for various applications. On the 
one hand, they may be used as PGR primers for amplification of nucleic acid 
molecules according to the invention. On the other hand, they can be useful 
tools for the detection of the expression of nucleic acid molecules according 
to the Invention in plants or any other organism, for example mammals, by 
e.g. in-situ hybridization or Northem-Blot hybridization. Another application is 
the use as a hybridization probe to identify nucleic acid molecules hybridizing 
to the nucleic acid molecules according to the invention by homology 
screening of genomic or cDNA libraries. Nucieic acid molecules according to 
this preferred embodiment of the Invention which are complementary to a 
nucleic acid molecule as described above may also be used for repression of 
expression of a nucleic acid molecule encoding a polypeptide having RdRP 
activity, for example due to an antisense effect or for the construction of 
appropriate ribozymes which specifically cleave such nucleic acid molecules. 
Furthermore, the person skilled in the art is well aware that It Is also possible 
to label such a nucleic acid probe with an appropriate marker for specific 
applications^ such as for the detection of the presence of a nucleic acid 
molecule of the invention In a sample derived from an organism. 
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The present invention also relates to vectors, particularly plasmida, coamids, 
viruses, J^acteriopKages used conventionally in genetic engineering that 
contain a nucleic acid molecule according to the Invention. 

In a prefenred embodiment the nucleic acid molecule present in the vector is 
linked to regulatory elements which allow the expression of the nucleic acid 
molecule In prokaryotic and/or eukaryotic cells. The nucleic acid molecule 
encoding the polypeptide having RdRP activity and the template nucleic acid 
molecule may be contained In different or in the same vector Expression 
comprises transcription of the nucleic acid molecule, preferably into a 
translatable mRNA. Regulatory elements ensuring expression In prokaryotic 
and/or eukaryotic celts are well known to those skilled in the art. In the case of 
eukaryotic cells they normally comprise promoters ensuring Initiation of 
transcription and optionally poiy-A signals ensuring termination of 
transcription and stabilization of the transcript. Additional regulatory elements 
may Include transcriptional as well as translatlonai enhancers. 

The present invention furthermore relates to host cells comprising a vector as 
described above or a nucleic acid molecule according to the invention 
wherein the nucleic acid molecule Is foreign to the host cell. 

By "foreign" It is meant that the nucleic acid molecule is either heterologous 
with respect to the host cell which means that said nucleic acid molecule is 
derived from a cell or organism with a different genomic background, or that is 
homologous with respect to the host cell but located in a different genomic 
environment than the naturally occurring counterpart of said nucleic acid 
molecule. This means that, if the nucleic acid molecule is homologous with 
respect to the host cell, it is not located in Its natural location in the genome of 
said host cetL In this case the nucleic acid molecule may be either under the 
control of its own promoter or under the control of a heterologous promoter. 
The vector or nucleic acid molecule according to the Invention which is 



present in the host cell may either be integrated into the genome of the host 
ceil or It may be maintained extrachromosomally. In this respect, it is also to 
be understood that the nucleic acid molecule of the Invention can be used for 
"gene targeting" and/or "gene replacement", for restoring a mutant gene or for 
creating a mutant gene via homologous recombination. 

The host cell can be any prokaryotic or eukaryotic cell, such as a bacterial, 
insect, fungal, worm, plant or animal cell. Preferred fungal cells are, for 
example, those of the genus Saccharomyces, in particular those of the 
species S. cerevisiae. 

Another object of the invention is a method for the preparation of such 
polypeptides encoded by the nucleic acid sequence of the invention which 
comprises the cultivation of host cells according to the invention which, due to 
the presence of a vector or a nucleic acid molecule according to the invention, 
are able to express such a nucleic acid molecule, under conditions which 
allow expression of the nucleic acid molecule and recovering of the so- 
produced polypeptide from the culture. Depending on the specific constaicts 
and conditions used, the polypeptide may be recovered from the cells, from 
the culture medium or from both. For the person skilled in the art it is well 
known that it is not only possible to have as an expression product a native 
polypeptide but also to obtain the polypeptide as a fusion polypeptide or to 
add signal sequences directing the polypeptide to specific compartments of 
the host cell, ensuring secretion of the polypeptide into the culture medium, 
etc. 

The present invention furthermore relates to polypeptides encoded by the 
nucleic acid molecules according to the invention or produced by the above- 
described method, and to enzymaticaliy active fragments of such 
polypeptides having RdRP activity. In this context. It Is also understood that 
the polypeptides according to the invention may be further modified by 
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conventionaJ methods known in the art. For example^ such a polypeptide or 
fragment thereof can be chemically modified according to standard methods. 
By providing the polypeptides according to the present invention It is also 
possible to detennine fragments which retain enzymatic activity, namely the 
capability of RNA-directed RNA synthesis as defined in this specification. This 
allows the construction of chimeric proteins and polypeptides comprising at 
least part of the amino sequence of the polypeptide of the invention, which is 
crucial for RdRP activity and other functional amino acid sequences. The 
other functional amino acid sequence(s) may be either physically linked by, 
e.g., chemical means to the polypeptide of the invention or to an 
enzymatlcally active part thereof or may be fused by recombinant DNA 
techniques well known in the art. 

In a preferred embodiment the above-described polypeptide or fragment is 
phosphoryiated. As set forth above, the amino acid sequence of the 
polypeptide of the invention may be phosphorylated due to the presence of 
possible phosphorylation sites. Thus, it may be possible to regulate the 
function of said polypeptide by phosphorylation. Phosphoryiatfon of the 
polypeptide of the invention may be effected in vivo in the host which 
expresses said polypeptide or in vitro by, for example, bringing an appropriate 
protein kinase in contact with said polypeptide or fragment. 

Furthermore, the present invention relates to antibodies specifically 
recognizing polypeptides according tp the Invention or parts, i.e. specific 
fragments or epitopes, of such polypeptides. Specific epitopes or fragments 
may, for example, comprise amino acid sequences which constitute domains 
which are characteristic for the polypeptides according to the invention, such 
as described In the appended examples. 

These antibodies can be monoclonal antibodies^ polyclonal antibodies or 
synthetic antibodies as well as fragments of antibodies, such as Fab. Fv, or 
scFv fragments etc. These antibodies or fragments thereof can be used, for 
example, for the immunoprecipltation and immunolocailzatlon of polypeptides 
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according to the Invention as well as for the monitoring of the synthesis of 
such polypeptides, for example, in recombinant organisms, and for the 
identification of polypeptides interacting with the polypeptides according to the 
invention. They can further be employed in the screening of expression 
libraries. 

Moreover, the present invention relates to an antagonist/inhibitor of the 
polypeptide of the invention. Potential antagonists are, for example, 
oligonucleotides or antibodies which bind to a nucleic acid molecule or 
polypeptide of the invention, respectively, such that either the expression of 
the nucleic acid molecule encoding a polypeptide of the invention or the 
RdRP activity of the polypeptide of the invention is prevented. Potential 
antagonists also include antisense constructs prepared through the use of 
antlsense technology. Another potential antagonist is a small molecule which 
binds to the catalytic or to a regulatory portion of the polypeptide of the 
Invention such that nomrial enzymatic activity Is prevented. Examples of small 
molecules include, but are not limited to nucleotide analogs, small 
polypeptides or peptide-like molecules or any combinations thereof. Such 
antagonists may be identified by using the nucleic acid molecules, vectors 
and polypeptides of the Invention. 

Thus, In a further embodiment the present invention relates to a method for 
identifying an antagonist/inhibitor of the polypeptide of the Invention 
comprising: 

(a) contacting the polypeptide of the invention with a plurality of 
compounds to be screened; 

(b) determining whether said polypeptide is stilt capable of RNA-dlrected 
RNA synthesis; and 

(c) identifying the compound which inhibits the RNA-dlrected RNA 
synthesis. 
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Said plurality of compounds may be comprised in, for example, cell extracts 
from, e.g. plants or microorganisms. Furthermore, said compounds may be 
known \n the art but hitherto not known to be an activator/inhibitor of the 
polypeptide of the invention. In the later case, the method would comprise the 
following steps: 

(a) contacting the potypeptide of the invention with a compound to be 
tested; 

(b) determining whether said polypeptide is still capable of RNA-directed 
RNA synthesis. 

As discussed in the background section of the description of the present 
invention, the RdRP synthesizes "antisense" RNAs from "sense" RNA- 
templates. Said "antisense" RNAs can hybridize to complementary parts of 
mRNAs or invading virus RNAs, which would lead to degragation of the so 
formed double-stranded regions. Furthermore, said "antlsense RNAs" may 
bind to DNA, for example, via triple helix formation and thereby block gene 
expression. Thus, it is possible to suppress the expression of target nucleic 
acid molecules and preferably genes by introduction of template nucleic acid 
molecules transcribed by RdRP into host cells which either naturally express 
an RdRP or which were genetically engineered for the expression or 
overexpression of the polypeptide of the invention. The degree of "antlsense" 
RNA production may depend on said template nucleic acid molecules in 
serving as an efficient template for the RdRP. With the aid of the RdRP of the 
present Invention, rt is now possible to determine the appropriate templates. 
Thus, in a further embodiment, the present invention relates to a method for 
determining whether a nucleic acid molecule is capable of serving as a 
template for RNA-directed RNA synthesis comprising: 

(a) contacting the polypeptide of the Invention with a preferably single 
stranded nucleic acid molecule; and 

(b) determining whether the complementary strand of said nucleic acid 
molecule is synthesized. 
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The term "template" in the sense of the present invention relates to nucleic 
acid molecules of at least 15 nucleotides in length which are capable of 
serving a^ a template for the polypeptide of the invention. In other words said 
template can be transcribed into complementary RNA copies, preferably up to 
the full length of these templates. Said template nucleic acid molecule may be 
single stranded RNA or preferably single stranded DNA or a hybrid thereof. 
Furthermore, said template nucleic acid molecule may contain, for example, 
thioester bonds and/or nucleotide analogs commonly used In oligonucleotide 
antisense approaches. Such modifications may be useful for the stabilisation 
of the template nucleic acid molecule against endo- and/or exonucleases. 
The template nucleic acid molecules may also be transcribed by an 
appropriate vector containing a chimeric gene which allows for the 
transcription of a template RNA In prokaryotic and/or eukaryotic cell. 

Moreover, the present invention relates to a pharmaceutical composition 
comprising at least one of the aforementioned nucleic acid molecules, 
vectors, polypeptides, antibodies and/or antagonists/Inhibitors according to 
the invention either alone or in combination, and optionally a pharmaceuticaily 
acceptable carrier or excipient. Examples of suitable pharmaceutical carriers 
are well known In the art and include phosphate buffered saline solutions, 
water, emulsions, such as oil/water emulsions, various types of wetting 
agents, sterile solutions etc. Compositions comprising such carriers can be 
formulated by well known conventional methods. These pharmaceutical 
compositions can be administered to the subject at a suitable dose. 
Administration of the suitable compositions may be effected by different ways, 
e.g, by intravenous, intraperitoneal, subcutaneous, intramuscular, topical or 
Intradermal administration. 

Although the RdRP-mechanism so far is only described for plants, it is now 
possible to employ the nucleic acid molecules, vectors and polypeptides of 
the invention for the suppression of undesired gene expression In humans 
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and animals. For example, the possibility to express the RdRP in almost any 
organism opens up the chance to use this enzyme as a therapeutic agent for 
the control of, e.g^^^aricerand of virus infection in humans and animals. Even 
If a general RdRP-llke mechanism is not restricted to plants, it is expected 
that enhancement of RNA-medlated gene silencing or suppression could 
suppress growth of those cells which show highly Increased transcriptional 
activity. This enhancement can be achieved by introducing additional copies 
of the RdRP gene or by direct transfer of in vitro produced RdRP polypeptide, 
optionally together with a template nucleic acid molecule derived from the 
gene to be silenced or suppressed. 

An additionai medical application against undesired gene expression, e.g., 
virus infections and cancer is given when an active RdRP enzyme Is 
produced in vitro. There are more and more attempts to cure virus Infections 
and cancer with the aid of the '^antisense" RNA-technique in that synthetic 
"antisense" oligonucleotides are injected into the comesponding cells. This 
therapy mainly depends on the choice of the oligonucieotide. It is hardly 
predictable whether chosen oligonucleotides are stable in the cytoplasm of 
the target cell and whether they are efflciently directed to an accessible region 
of the target RNA. Moreover high amounts of "antisense" RNAs have to be 
injected to ensure that the target RNA is completely degraded. 
A more promising approach could be performed by activating the endogenous 
"RdRP-system" with appropriate "sense" RNA if such system exist In the 
organism. Following this approach there are major advantages. Useful 
••sense" nucleic acid molecules can be screened for by an in vitro RdRP 
assay as, for example, described above In that the suttabiltty of said 
molecules In serving as an efficient template for the enzyme is analysed, tf a 
suitable template is isolated, only low copy numbers of the template nucleic 
acid molecule have to be successfully transferred Into the target cells. The 
endogenous RdRP should amplify the con-esponding "antisense" RNA which 
will subsequently induce post transcriptional gene silencing. 
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The present invention also relates to diagnostic compositions or kits 
comprising at least one of the aforementioned nucleic acid molecules, 
vectors, polypeptides, antibodies and/or antagonists/Inhibitors, and in the 
case of diagnostic compositions optionally suitable means far detection. 
The various compounds comprised in the kit and the diagnostic composition 
of the invention are preferably bottled In different containers. 
Said diagnostic compositions may be used for methods for detecting 
expression of a polypeptide of the Invention by detecting the presence of 
mRNA encoding a polypeptide having RdRP activity which comprises 
Isolation of mRNA from a cell and contacting the mRNA so obtained with a 
probe comprising a nucleic acid probe as described above under hybridizing 
conditions, detecting the presence of mRNA hybridized to the probe, and 
thereby detecting the expression of the polypeptide by the ceil. 
Further methods of detecting the presence of a polypeptide according to the 
present invention comprises immunotechniques well known in the art, for 
example enzyme linked Immunosorbent assays. 

Moreover, the present invention relates to the use of and methods employing 
a nucleic acid molecule, vector, and/or polypeptide of the invention and 
optionally or preferably a template nucleic acid molecule for treating a disease 
which is caused by the undeslred expression or overexpression of a gene. 

It is envisaged by the present invention that the nucleic acid molecules and 
polypeptides are administered either alone or in any combination, and 
optionally together with an appropriate RdRP template nucleic acid molecule 
described above, and/or together with a pharmaceutlcally acceptable carrier 
or excipient. Subsequent to administration, said nucleic acid molecules may 
be stably integrated into the genome of the mammal. On the other hand, viral 
vectors may be used which are specific for certain ceils or tissues which 
persist in said cells thereby conferring expression of the nucleic acid 
molecules in said cells. Suitable pharmaceutical carriers and exctpients are 
well known In the art. Elements capable of targeting a nucleic acid molecule 
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and/or polypeptides to specific cells are described In the prior art, for 
example. Somia et al., Pnoc. Natl. Acad. Sci.. USA 92 (1995). 7570-7574. The 
pharmaceutical compositions prepared according to the invention can be 
used for the prevention or treatment or delaying of different kinds of diseases, 
which are related to the expression or overexpression of a given gene or 
genes. 

Furthennore, it is possible to use a pharmaceutical composition of the 
invention which comprises a nucleic acid molecule or vector of the invention, 
and optionally a template nucleic acid molecule In gene therapy. Naturally, 
both nucleic acid molecules may be also comprised in the same vector For 
example, research pertaining to gene transfer into ceils of the germ line is one 
of the fastest growing fields in reproductive biology. Gene therapy, which is 
based on introducing therapeutic genes into cells by ex-vivo or in-vlvo 
techniques Is one of the most Important applications of gene transfer. Suitable 
vectors and methods for in-vitro or in-vlvo geno therapy are described in the 
literature and are Icnown to the person skilled in the art. The phamnaceutical 
compositions according to the invention can be used for the treatment of 
kinds of diseases hitherto unknown as being related to the expression and/or 
over expression of genes. 

It Is to be understood that the introduced nucleic acid molecules express an 
enzymaticaliy active polypeptide of the invention after Introduction into said 
cell and preferably remain in this status during the lifetime of said cell. On the 
other hand, the person skilled in the art may also use the nucleic add 
molecules of the Invention to "knock ouf* an endogenous gene encoding an 
RdRP, for example, by gene targeting or antlsense or ribozyme technology. 

In a further prefen-ed embodiment of the method of the Invention, said ceil Is a 
germ line cell or embryonic cell or derived therefrom. In a most preferred 
embodiment, said cell Is an egg cell or derived therefrom. 
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In a further embodiment the Invention relates to a method for treating a 
disease caused by the undesired expression or overexpression of a nucleic 
acid molecule comprising: 

(a) obtaining cells from a mammal; 

(b) introduction of a nucleic acid molecule or a vector of the invention and 
optionally a template nucleic acid molecule derived from said nucleic 
acid molecule which causes the disease into said cells; and 

(c) reintroducing the cells obtained as a product of step (b) Into said 
mammal or into a mammal of the same species. 

Said template nucleic acid molecule is preferably identifidd or determined by 
methods described hereinabove. Preferably, said mammal Is a human, rat or 
mouse and/or the cell is a germ cell, an embryonic ceil or an egg ceil or a cell 
derived therefrom. 

The present Invention also relates to a method for the production of a 
transgenic mammal comprising Introduction of a nucleic acid molecule or 
vector of the invention and optionally a template nucleic acid molecule into a 
germ cell, an embryonic cell or an egg or a cell derived therefrom. The 
invention also relates to transgenic animals obtained by said method. 

Further, the present Invention relates to transgenic mammalian cells, 
comprising stably integrated into the genome 

(a) a nucleic acid molecule encoding a polypeptide of the Invention, which 
is linked to regulatory elements allowing transcription and/or 
expression of the nucleic add molecule In mammalian ceils and/or 

(b) a template nucleic acid molecule determined by the method of the 
Invention which is linked to regulatory elements allowing transcription 
of said template nucleic acid molecule in mammalian ceils. 

In a further embodiment the invention relates to transgenic mammals 
comprising the above described mammalian cells. 
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In a further embodiment the present invention relates to a mammalian cell 
which contains stably Integrated Into the genome a nucleic acid molecule of 
the invention which is linked to regulatory elements allowing transcription of 
said nucleic add molecule in mammalian cells wherein the presence of said 
nucleic acid molecule and/or the transcription and/or expression leads to 
reduction of the synthesis of the polypeptide of the Invention in said cells. 

In a preferred embodiment the reduction of the synthesis of the polypeptide of 
the invention In said cells is achieved by antlsense, ribozyme and/or co- 
suppression effect. The present invention thus relates also to a transgenic 
mammal comprising such mammalian cells. 

Preferably, the mammal referred to in the above embodiments is a human, a 
rat or a mouse. 

Further embodiments and applications which also relate to the technology of 
transgenic mammals are discussed below with respect to transgenic plants. 
The person skilled In the art ia able to apply, where appropriate, the 
technologies and apllicatksns discussed in connection with transgenic plants 
also to transgenic mammals. 

The invention further reiates to a transgenic plant cell comprising stably 
integrated into the genome 

(a) a nucleic acid molecule of the invention which is linked to regulatory 
elements allowing transcription and/or expression of the nucleic acid 
molecule in plant cells and/or 

(b) a template nucleic acid molecule detemiined by the method of 
the invention which Is linked to regulatory elements allowing 
transcription of said template nucleic acid molecule in plant cells. 
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The nucleic acid molecules according to the invention are in particular useful 
for the genetic manipulation of plant cells in order to increase or decrease the 
RdRP activity iojalaota and by transfening the "RdRP-aystem" to organisms 
that lack a comparable mechanism to obtain plants with modified, preferably 
with Improved or useful phenotypes. 

For example, although a threefold increased concentration of the RdRP 
protein can be obtained by vims- or viroid-infection In tomato, the enzyme is 
still hanjiy detectable In those plants. Therefore it is expected that resistance 
of plants against RNA viruses can be improved by increasing the RdRP 
activity. For this purpose it fa prefen^ed to Intfioduce at first a functional full- 
length RdRP cDNA copy into the genome of several crop plants. 
Regenerated transformants that have been screened for high RdRP activity 
have to be retranaformed with a second construct to finally Induce the RNA* 
mediated resistance against a particular virus. This transgene construct 
should provide In vivo transcripts which have to be homologous to a part of 
the virus RNA. In plants carrying both transgenes the improved resistance Is 
based on the phenomenon that the increased RdRP activity leads to highly 
efRcient production of "antisense" RNA. This would also Include increased 
transcription of "antisense* RNAs from the nuclear-encoded virus-specific 
templates. These virus-specific "antisense" RNAs should be capable of 
inducing the sequence-specific RNA degradation process by hybridizing to 
the invading vinjs RNA. 

The same gene silencing system could be applied to Inactivate any 
endogenous gene in possibly all higher eukaryots. In case that an efficient 
RdRP activity can be introduced into a particular organism it should be 
possible to down-regulate genes by re- or cotransfbrming with transgene 
constructs that are encoding the con-esponding RdRP templates. 
Thus, the present invention relates also to transgenic plant or mammalian 
cells which contain stably Integrated into the genome a nucleic acid molecule 
according to the Invention linked to regulatory elements which allow for 
expression of the nucleic acid molecule in plant or mammalian cells and 
wherein the nucleic acid molecule is foreign to the transgenic plant or 
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mammalian cell and optionally a template nucleic acid mofecule.For the 
meaning of the temi "^reign". we refer to our definition provided herein 
above. _ 

Further, the present invention also relates to transgenic plants or mammalians 
comprising transgenic plant or mammalian cells according to the invention. 

Furthennore, the Invention relates to the transgenic plant cell which contains 
stably integrated into the genome a nucleic acid molecule according to the 
invention or part thereof, wherein the transcription and/or expression of the 
nucleic acid molecule or part thereof leads to reduction of the synthesis of a 
polypeptide having RdRP activity. 

In a preferred embodiment, the reduction Is achieved by an anti-sense, 
ribozyme and/or co-suppression effect. 

The provision of the nucleic acid molecules according to the invention opens 
up the possibility to produce transgenic plant cells with a reduced level of the 
polypeptide as described above and, thus, Inactivate antisense RNA 
production. Techniques how to achieve this are well known to the person 
skilled in the art. Experiments described In the prior art Indicate that 
overexpressed transgenes as wail as several tissue-specific or 
developmentally regulated endogenous plant genes can be post- 
transcriptionally silenced by the RdRP system. Therefore it should be possible 
to inactivate this putative protection mechanism by decreasing the RdRP 
activity. This inactivetion could be perfonned, for example, by •"antisensing" 
the RdRP mRNA, or by ribozymes, molecules which combine antisense and 
ribozyme functions, molecules which provide far cosuppression effects, or by 
tagging the plant RdRP gene. For both purposes the knowledge of the RdRP 
cDNA or RdRP gene sequence is a prerequisite. In order to post- 
transcriptionatly silence the RdRP gene, real "antisense** transgene constructs 
have to be designed. It is easy to understand that a "sense" transgene 
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construct does not make sense because the enzyme that Is to be inactivated 
is Just exactly the enzyme that can synthesize "antisense" RNA from "sense" 
RNA templates. The "antisense approach" to decrease the RdRP protein 
concentration could have the advantage of not completely destroying RdRP 
activity. In case the RdRP bears an essential part in healthy plants, the total 
loss of RdRP would be a lethal factor. As a function of the promoter driving a 
transgenic "antisense" constnjct or as a function of the transgene Integration 
locus, plants could be isolated that show lower RdRP levels than wild-type 
plants. To analyse whether plants are able to survive without any RdRP 
activity, the RdRP gene can be knocked-out by gene tagging experiments. 
Foreign DNAs. as for example the T-DNAs of Agrobmct^rium tumefadens, 
integrate Into the plant genome at random loci. Provided that the sequence of 
the RdRP gene is known, plants can be screened for T-DNA insertions at the 
RdRP gene locus by PGR (Koes et at., 1992). Plants in which successful 
tagging of the RdRP gene occun-ed, should survive because in a diploid plant 
only one allele is inactivated. In case RdRP-minus plants show nonral 
growth, progeny homozygous for the insertion allele are obtained after selfing. 

When using the antisense approach for reduction of the above described 
enzymatic activity In plant cells, the nucleic acid molecule encoding the 
antlsense-RNA is preferably of homologous origin with respect to the plant 
species used for transformation. However, it is also possible to use 
heterologous nucleic acid molecules which display a high degree of homology 
to endogenously occurring nucleic acid molecules encoding a polypeptide 
with the respective enzymatic activity. In this case the homology Is preferably 
higher than 80%. particularty higher than 90% and still more preferably higher 
than 95%. 

For the expression of the nucleic add molecules according to the invention In 
sense or antisense orientation In plant cells, the molecules are placed under 
the control of regulatory elements which ensure the expression In plant cells. 
These regulatory elements may be heterologous or homologous with respect 
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to the nucleic acid molecule to be expressed as well with respect to the plant 
species to be transformed. In general, such regulatory elements comprise a 
promotei^ active in plant cells. To obtain expression In all tissues of a 
transgenic plant, preferably constitutive promoters are used, such as the 35 S 
promoter of CaMV (Odell et aL. Nature 313 (1985), 810-812) or promoters of 
the polyublquitin genes of maize (Christensen at al., Plant MoL BioL 18 
(1982), 675-689), In order to achieve expression in specific tissues of a 
transgenic plant It is possible to use tissue specific promoters (see, e.g., 
Stockhaus et al., EMBO J, 8 (1989). 2245-2251). Known are also promotere 
which are specifically active In tubers of potatoes or in seeds of different 
plants species, such as maize, Vtcia, wheat, barley etc. Inducible promoters 
may be used in order to be able to exactly control expression. An example for 
Inducible promoters are the promoters of genes encoding heat shock 
proteins. 

The regulatory elements may further comprise transcriptional and/or 
translational enhancers functional In plants cells. 

Furthermore, the regulatory elements may include transcription termination 
signals, such as a poly-A signal, which lead to the addition of a poly A tail to 
the transcript which may improve its stability* 

In the case that a nucleic acid molecule according to the invention is 
expressed in sense orientation it is in principle possible to modify the coding 
sequence In such a way that the polypeptide is located in any desired 
compartment of the plant cell. These include the endoplasmatic reticulum, the 
vacuole, the mitochondria, the plastides, the apoplast the cytoplasm etc. 
Methods how to carry out these modifications and signal sequences ensuring 
localization in a desired compartment are well known to the person skilled in 
the art. 

Methods for the Introduction of foreign DNA into plants are also well known In 
the art. These include, for example, the transfomriation of plant cells or tissues 
with T-DNA using Agrobacterium tumefeciens or Agrobacterium rhizogenes 
(EP-A 120 516; EP-A 116 718; Hoekema in: The Binary Plant Vector System, 
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Offsetdrukkerij Kanters BV, Alblasserdam (1985). Chapter V: Fraley et al., 
cm. Rev. Rant Sci. 4. 1^6 und An et al.. EMBO J. 4 (1985). 277-287). th© 
fusion ofjsrptoplaats. direct gene transfer (see. e.g.. EP-A 164 575), injection, 
electroporation, blotistic methods like particie bombardment and other 
methods known in the art. 

The transformation of most dicotyledonous piants Is possible vArith the 
methods described above. But also for the transformation of 
monocotyiedonous plants several successful transformation techniques have 
been developed. These include the transformation using biolistic methods 
(Wan and Lemaux. Plant PhysioL 104 (1994), 37-48; Vasil et aL, 
Biorrechnology 11 (1993), 1553-1558), protoplast transfomnatlon, 
electroporation of partially permeabilized cells, introduction of DNA using 
glass fibers, etc. 

The reduction of the synthesis of a polypeptide according to the invention in 
the transgenic plant cells results in an alteration in the modulation of gene 
expression. In transgenic plants comprising such cells this can lead to various 
physiological, developmental and/or morphological changes. 

The present Invention also relates to transgenic plants comprising the above- 
described transgenic plant cells. 

The present invention also relates to cultured plant tissues comprising 
transgenic plant cells as described above which either show overexpression 
of a polypeptide according to the invention or a reduction in synthesis of such 
a polypeptide. 

In yet another aspect the invention also relates to harvestable parts and to 
propagation material of the transgenic plants according to the Invention which 
either contain transgenic plant cells producing a polypeptide according to the 
invention (and If an endogenous RdRP gene Is present in the plant above the 
wild type level of RdRP in said plant) or which contain ceils which show a 
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reduced activity of the described polypeptide. Harvestable parts can be in 
principle any useful parts of a plant, for example, leaves, stems, fruit, seeds, 
roots etc^ ^ 

Propagation material Includes, for example, seeds, fmlts, cuttings, seedlings, 
tubers, rootstocks etc. 

In general, the plants which can be modified according to the Invention and 
which either show overexpression of a polypeptide according to the Invention 
or a reduction of the synthesis of such a polypeptide can be derived from any 
desired plant species. They can be monocotyledonous plants or 
dicotyledonous plants, preferably they belong to plant species of interest in 
agriculture, wood culture or horticulture interest, such as crop plants (e.g. 
maize, rice, bariey, wheat, rye. oats etc.). potatoes, oil prxxJudng plants (e.g. 
oilseed rape, sunflower, pea nut, soy bean, etc.)* cotton, sugar beet, sugar 
cane, leguminous plants (e.g. beans, peas etc.). wood producing plants, 
preferably trees, etc. 

It is furthemnore possible to use the nucleic acid molecules, vectors and/or 
polypeptides of the Invention and, optionally, an appropriate RdRP template 
nucleic acid molecule which may be identified by the afore-deacribed method 
for inhibiting expression of any desired gene by transferring the RdRP system 
to organisms that either lack a comparable mechanism or do not sufficiently 
express their own RdRP. Furthermore, the nucleic acid molecules, vectors, 
antibodies and/or antagonist/inhibitors of the present invention can be used 
for inhibiting RNA-dlrected RNA synthesis and thereby confening stable 
heterologous gene expression in transgenic organisms. 

Besides the above-described possibilities to use the nucleic acid molecules, 
vectors and polypeptides according to the invention for e.g. the genetic 
engineering of organisms and their use to identify homologous molecules, the 
described nucleic acid molecules may also be used for several other 
applications, for example, for the Identification of nucleic add molecules which 
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encode polypeptides which interact with the RdRP described above. This can 
be achieved by assays well known in the art, for example, by use of the so- 
called yeast "^o-hybrid system", in this system the polypeptide encoded by 
the nucleic acid molecules according to the invention or a smaller part thereof 
is linked to the DNA-binding domain of the GAL4 transcription factor. A yeast 
strain expressing this fusion protein and comprising a lacZ reporter gene 
driven by an appropriate promoter, which is recognized by the GAL4 
transcription factor, Is transformed with a library of cDNAs which will express 
plant proteins fused to an activation domain. Thus, If a polypeptide encoded 
by one of the cDNAs is able to interact with the fusion protein comprising at 
least part of the RdRP polypeptide complex is able to direct expression of the 
reporter gene. In this way the nucleic acid molecules according to the 
invention and the encoded RdRP can be used to Identify proteins Interacting 
with the RdRP, such as protein kinases, transcription factors and the like. 
Other methods for Identifying proteins which interact with the proteins 
according to the invention or nucleic acid molecules encoding such molecules 
are, for example, the in vitro screening with the phage display system as well 
as filter binding assays. 
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Description of tha Figures 
Figure 1_ 

15 M9 of genomic tomato DNA (cv. Rentita) were cut with EcoR1 (lane 1), 
BamH1 (lane 2), Hind3 (lane 3), Xba1 (lane 5), and Hae3 (lane 6), 
respectively, (5U/pg) and electrophortzed in a 13 x 18 cm 0.8% TEA- 
agarosegel. The DNA was then vacuum-blotted onto a nylon membrane and 
hybridized against a radom primed ^^P-labeled RdRP-specrfrc cDNA fragment 
as a probe. 

The membrane was autoradiographed for 16 h at -SO'C using a BIOMAX^^ 
MR film (Kodak) and a Quantalll-intenslfying screen (Du Pont). 
Lane 4: molecular weight standard. X-DNA cut with Pst 1, 

Figure 2 

1 5 Mg of genomic DNA isolated from F>otato (lane 1 )» tobacco (lane 2), and the 
two tomato cultivars "St Plen^" (lane 4) and "Retlnta" (lane 5) were cut with 
Hind3 (5U/pg) and electrophorized in a 13 x 18 cm 0.8% TEA-agarosegel. 
The DNA was then vacuum-blotted onto a nylon membrane and hybridized 
against a radom primed ^P-labeled RdRP-specific cDNA fragment as a 
probe. 

The membrane was at first autoradiographed for 16 h to visualize lane 1, 4, 
and 5 and then for 72 h to visualize lane 2 at -80''C using a BIOMAX™ MR 
film (Kodak) and a Quantalll-intensifying screen (Du Pont). 
Lane 4: molecular weight standard, X-DNA cut with Pst 1 . 

Figure 3 

15 pg of total RNA isolated from PSTVd-infected (•►) and viroid-free (-) tomato 
plants (cv. Rutgers and cv. Basket Pak, respectively), were denatu rated by 
glyoxalation and electrophorized In a 13 x 18 cm 1.5% phosphate-buffered 
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agarosegeJ. The RNA was then captllar-blotted onto a nylon membrane and 
hybndized against a radom primed '^^P^iabeled RdRP-spectfIc cDNA fragment 
aa a probe. 

The membrane was autoradiographed for 16 h at -SO^'C using a BIOMAX'^ 
MR film (Kodak) and a QuantaJllHntensrfylng screen {Du Pont). 

Figure 4 

1 Ml aliquods of elution fractions from 4 drfferent protein purification columns 
(lane 2 and 3: Q Sepharose FF, lane 4: PoIy(A) Sepharose, lane 5: Poly(U) 
Sepharose. and lane 8: Heparin Sepharose) that contained the highest RdRP 
activity were applied onto SDS-PAGE (Phast system. Phamnacia) and were 
then subjected to Westem analysis, fmmuno-detection was performed wrth 
the RdRP-spectfIc antibody Ao^^i. 

Lane M: molecular weight protein standard (205, 116, 97, and 58 kDa 
proteins). 

Lane 1; 1 |j| aliquod of the protein extract that was subsequently loaded onto 
the Q Sepharose FF column. 

Numbers In parentheses are corresponding to the numbers of two Q 
Sepharose FF elution fractions. 

Figure 5 

1 Ml aliquods of the Sephadex G-200 gel filtration fractions [lane 2: retention 
volume fraction (n/f) 12. lane 3: rvf 14. lane 4: rvf 15, lane 5: n/f 17, and lane 
6: 19] were applied onto SDS-PAGE (Phast system, Pharmacia) and were 
then subjected to Westem analysis. Immuno-detection was performed with 
the RdRP-spectflc antibody Ap^^, 

Lane 1 : 1 pi aliquod of the protein extract that was subjected onto the column 
(ON). 

Figure 6 

Desicn of RdRP-soeci f ic PCR-Prirers. 
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Doscrfption of the Invontlon 
Example 1 ; RdRP purification 

The concentration of plant-encoded RdRP in non-fnfected and apparently 
healthy plants Is rather low. From 1 kg of apical tomato leaves corresponding 
to about 1000 tomato plants only about 10 to 20 |jg of a protein preparation 
having RdRP activity could be Isolated (Schiebel et a/., 1993a). This low 
concentration is also reflected by the fact that the RdRP preparation from 
tomato leaves had to be concentrated -100.000-fold to obtain at least an 
electrophoretic homogeneous protein preparation (Schiebel et a/., 1993a). 
Micro-sequencing of protein preparations purified according to the method of 
Schiebel et aL (1993a) gave no reliable sequence information. 

Thus, a new purification method had to be applied which was performed as 
follows, RdRP was solubiiized from freshly harvested apical leaves of tomato 
plants {Lycopersicon esculentum, cultlvar 'Rentita*). The plants that were 
grown under semi-controlled greenhouse conditions (25^0, 15-h photoperiod) 
were systemically Infected (Tabler and Sanger, 1985) with the PSTVd type 
strain (Gross et al. 1978) at the two leaf stage i.e. about 4 weeks after sowing 
(Tabler and Sanger, 1984). Only the young and still unexpended leaves from 
the shoot tips (about 1 g per plant) were han/ested three to four weeks after 
Inoculation. 

Portions of 100 g each were homogenized with 500 ml of buffer A (50 mM 
Tris-acetate, pH 7.4. 10 mM K-acetate, 1 mM EDTA, 10 mM 2- 
mercaptoethanol. 0.5 mM PMSF) in a Waring type blender (Braunmixer MX 
32) for 2 minutes at position 3. The homogenate was strained through 135 pm 
nylon sieve doth (Nytal, Schweizer Seidengazefabrlk, CH-9425 Thai 
SG/Swiss) and the retentate was rehomogenized with 220 ml of buffer A. The 
combined filtrates from 200g of leaf material were centrifuged at 1 .000 x g for 
15 minutes (Beckman Rotor J8B). The supernatant was mixed with 0.3 
volumes of glycerol 86% and the mixture centrifuged at 30.000 x g for 1 h In a 
Beckman type 19 rotor. The supernatant containing the "soluble RdRP" 
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(RdRP-s) and the majority of TNTase was U86d for ammonlumaulfate 
precipitation. 

The 30.0QO X g pellet containing RdRP was reeuspended in 100 ml of buffer B 
(50 mM Tris-acetate, pH 8.2. 50 mM K-acetate, 1 mM EDTA, 25% (v/v) 
glycerol (Ph.Eurill), 10 mM 2-mercaptoethanof, 0.5 mM PMSF) using a 
□ounce homogenrzer. The suspension was slowly stirred for 30 minutes and 
centrifuged for 50 min at 40.000 rpm (146.000 x g) in a Beclcman rotor 50.2 
Ti. The supernatant represents extract E1 . The pellet was reextracted twice 
with 0.5 ml/g buffer B resulting in extract E2. Extracts E1 and E2 containing 
the 'membrane-bound RdRP' (RdRP-b) were shockfrozen in solid CO2 and 
stored at -70*C for loading onto a DEAE-Sepharose column. 
The 30.000 x g supematant was mixed with the same volume of 3,95 M 
ammonium sulfate solution. After stirring for one hour at 4*C, the suspension 
was centrifuged at 10»000 x g for 30 min. The pellet was dissolved with 40 ml 
of buffer C (25 mM Tris-acetate. pH 8.2, 1 mM EDTA, 20% glycerol. 3 mM 2- 
mercaptoethanol), dialyzed over night against 1 liter of buffer C. cleared by 
centrifiigation at 10.000 x g for 30 min, shockfrozen in solid CO2 and kept at - 

70**C for chromatography. 

Both, supematant and extracts from the peilet were assayed for RdRP activity 
as follows: The assay for RdRP activity is based on the Incorporation of 
radioactively labeled 5-nudeoside monophosphates from the corresponding 
[alpha-32P]NTPs into transcription products of a given RNA template. This 
assay is not RdRP-specitIc because the ubiquitous terminal nucleotidyl 
transferases (TNTase) are able to terminally incorporate radioactive 
ribonucleotides into polymeric RNA products. Such TNTases are RNA 
uridylyi-transferase (Zabel et al.. 1981) (EC 2.7.7.52). polyadenylyl- 
transferase (Edmonds, 1982) (EC.2.7.7.19) or other TNTases (Boege, 1982) 
which usually contaminate RdRP preparations. No specific direct assays are 
available, as yet, that allow the unambiguous discrimination between the true 
template-directed transcription by RdRP and the terminal addition of 
nucleotides to the designated template RNA by TNTases. Consequently. 
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RdRP and contaminating TNTases had to be separated from each other 
before their individual activities can be quantitated. 

One unit- of enzynfie activity is defined as the amount of enzyme which 
catalyzes the incorporation of 1 nmol 5'-uridine monophosphate into high 
molecular weight RNA within 30 min. If not stated otherwise the assay was 
performed in prelubricated 1.7-ml test tubes {Sorenson Bioscience. Inc, 
MultlTM LIqld Handling Products. Salt Lake City, Utah. USA) in a final volume 
of 25 Ml containing 50 mM Tris-acetate (pH 7.8), 10 mM Mg-acetate, 2 mM 
dithloerythritol. 0.3 mM each of ATP. CTP, GTP. 0.02 mM [alpha-32p]UTP 
(adjusted to 0.5 Ci/mMol). 1 \ig TMV RNA. 0.01% Tween 20. and 2 or 5 |Jl of 
RNA polymerase preparation. Reactions were initiated by adding the enzyme. 
After 30 min of incubation at 370C. the reaction tubes were transfered to ice. 
After mixing with 15 m' of 13 mM UTP. 10 pi of water was added as a wash 
solution and the tube contents were spotted on Whatman 3MM paper strips (2 
X 10 cm) before perfonnance of descending chromatography in 2 M Na- 
acetate. pH 5,2yethanol = 1/1. The radioactivity of the RNA products 
remaining at the origin of the chromatogram was detenfnined by liquid 
scintillation counting (6 ml cocktail rotlszint eco plus. Roth) and used for the 
calculation of enzyme activity. Under the given conditions and Spi 
RdRP/assay a concentration of one unit per milliliter corresponded to 4800 
cpm (Schiebel et aL, 1993b). This assay was used throughout the examples 
unless othenvise stated. 

The supernatant contained about 5 U per 150 g of leaves whereas the 

con^sponding pellet contained about 1.4 U of extractable RdRP. 

All RdRP preparations were first chromatographed on DEAE-Sepharose 

which removed nucleic acids, most of the pigments and an inhibitor of RdRP 

activity. 

The extracts containing RdRP-b from the membrane-rich pellet or the 
ammonium sulfate-fractlonated 30,000 x g supernatant were loaded onto a 
column of DEAE-Sepharose Fast Flow (FPLC system. Pharmacia: 5 cm x 10 
cm) with a flow rata of 6 to 15 cm/h (2 - 5 mi/min) using a peristaltic pump. 
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After elutlng non-adsorbed material vsrith about 70 ml of buffer C, a linear 
gradient of 0.0 to 0.7 M ammonium acetate In buffer C was applied with a flow 
rate of 3 ta 12 cm/h (1 to 4 ml/min) depending on the pressure limit of about 
0.5 MPa. The volume of the gradient was 500 ml up to 0,5 M salt and 100 ml 
for the higher salt concentrations. Fractions of 20 ml were collected and those 
fractions containing the main enzyme activity were pooled, shockfrozen and 
stored. 

The enzyme preparation from the DEAE-Sepharose chromatography 
containing about 0.27 M ammonium acetate was diluted with one volume of 
buffer D (20 mM Tris-HCI pH 8.0. 0.5 mM EDTA, 1,5 mM DTE, and 0.012% 
Tween 20) and loaded onto 3 ml poly{U)Sepharo5e 4B (Pharmacia) in a 1 x 
3.7 cm column (FPLC system, Pharmacia; C10/10) equilibrated wtth buffer D. 
A total volume of 120 ml was loaded with a flow rate of 1 ml/mln. After elution 
of nonadaorbed proteins with buffer D a linear gradient of 30 ml from 0.0 to 1 
M NaCI In buffer D was applied and fractions of 1.5 ml were collected. Those 
fractions containing enzym activity were pooled, shockfrozen and stored. 

In order to concentrate the RdRP containing eluate from the first poly(U)- 
Sepharose chromatography the corresponding fractions were diluted with two 
volumes of buffer D and loaded on 0.4 ml poly(U)-«epharDse 4B in a 0.5 x 1.9 
cm column (SMART system, Phanmacia; HR 5/2) with a flow rate of about 0.1 
ml/mln using a peristaltic pump. After elution of non-adsorbed proteins wtth 
buffer D the column was Installed into the Smart system. Elution was 
perfonned by sodium chloride Increasing In a linear gradient up to 1 .5 M. The 
flow rate during elution was 0.05 ml/min. Fractions of 0.1 ml were collected. 
Those fractions containing enzyme activity were analyzed by SDS PAGE and 
pooled resulting in the partially purified RdRP preparation. 
Pools from several chromatographic mns were collected. The proteins 
exhibiting a RdRP activity of 20 U In 4 mi desorption buffer (about 1 M NaCI in 
buffer D) were concentrated and concomitantly freed from salt by 
deoxycholate trichloroacetic add precipitation (Bensadoun and Weinstein, 
1976) In prelubricated 1.7-ml test-tubes. Insulin (25 Vig/m\) was used as 



34 



copreclpitant, centrifugation was performed In a swinging bucket rotor at 
11.000 rpm for 30 minutes. The resulting pellets were washed with ethancl at 
-20''C and used for analytical SDS PAGE (Phast system, Phannacia). The 
estimated amount of RdRP protein was about 5 pg. 

Example 2: Microsequenclng of the RdRP proteins 

The precipitated protein was prepared for SDS PAGE, applied onto a 7% 
polyacrylamid gel (Laemmli) and stained wtth coomaasle Brillant Blue R250 
(Sambrook et a/., 1989). The 128 kDa band was excised and cleaved in the 
gel according to Eckerskom and Lottspeich (1989) except that instead of 
trypsin, endoproteaae LysC (Boehringer Mannheim) with an enzyme:proteln 
ratio of 1:10 (w/w) was used. The peptides separated by reversed phase 
HPLC were sequenced using a 492A amino acid sequencer (Applied 
Biosystems) according to the manufacturers instructions. 

Example 3: Isolation of the RdRP cDNA 

Micro-sequencing of the RdRP protein resulted In only four definite AS- 
sequences from which the corresponding nuciek: acid sequence could be 
deduced. In Fig . 6 the sequence of two of the four peptides and the 
sequence of four of the corresponding synthetic oligonucleotides that had 
been designed for PCR-^xperlments are presented. 

In order to perfonn PGR with the RdRP-specrfic primers, cDNA had been 
synthesized from tomato mRNA. For mRNA isolation 20g of young leaves 
were harvested from potato spindle tuber viroid- (PSTVd) infected "Rutgers'* 
tomato plants that were grown in the greenhouse (see above). The plant 
material was immediately frozen in liquid nitrogen, and total RNA was isolated 
as described by Logemann et ah (1987). From 5mg of total RNA, polyA+ RNA 
was isolated using the *PolyATtract mRNA isolation System I' (Promega) 
following the manufacturers instructions. Subsequently l^g of the purified 
potyA-*- RNA was subjected to the 'cDNA Synthesis Kit' (Boehringer 
Mannheim) and cDNA was synthesized according to the manufacturers 
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instructions using an oligo {clT)i8 primer. The reaction was stopped by phenol- 
extraction and 1 Ml of the sample was diluted 1:100 with TE-buffer. 
PGR was performed with 1 pi of the diluted cDNA and using the primer-pair A 
(RdRP/8H1-RdRP/15R1: 0.1 nmol each) and B {RdRP/15H1-RdRP/8R1; 0.1 
nmol each), respectively. Amplifications were assayed In a IOOjjI reaction-mix 
containing 10|j| of 10x Assay Buffer (Eurogentec), IOmI of dNTPs (2 nmol/|j|), 
and lMi of EuroTaq polymerase (4U/jjl) (Eurogentec). The samples were 
transferred to a 'Crocodile II Thermocycler* block (Appllgene) and 30 cycles 
were started (1" QS'C, 1' 55'C, and 1' 72°C). After separation on a 0.8% TBE- 
agarose gel (Sambrook a/., 1989). analysis of the PGR products revealed 
that a single DNA fragment of about 800 bp in size had been amplified with 
primer pair B. This fragment (RdRPgoo) was eluted from the agrose gel 
(QlAquick Gel Extraction Kit, Qiagen) and cloned (Sambrook et al., 1989) into 
the T/A-type PGR cloning vector pTPGR (Wassenegger et al., 1994) which Is 
similar to the plasmid pCR™II that Is delivered with the cloning"^" Kit 
(Invltrogen). 

The sequence of the RdRPaoo was detennined on an automatic sequencer 
(ALFexpress. Pharmacia Biotech) using the 'Cy5 AutoRead Sequencing Kir 
(Pharmacia Biotech) and following the manufecturers sequencing procedure. 
The precise length of RdRPgoo was 833bp comprising an ORF that 
corresponds to 277 amino acid residues (AS) given In SEQ ID NO:3 which 
correspond to amino acid 700 to amino acid 917 of the amino acid sequence 
of SEQ ID No:2. 

Two custom ZAP EXPRESS""^ EcoRI cDNA libraries (Stratagene) had been 
established from polyA^ RNA which had been isolated as described above 
from young leaves of the tomato cultivars 'Rutgers' and 'Basket Pak'. From 
five to lOpg of the purified polyA'^-enriched RNA cDNA libraries were 
constmcted (Stratagene). The 5iz&-fractionated oligo(dT)-primed cDNAs 
(>500 bp) were ligated via EcoRI adaptors into the Lambda ZAP bearing pBK- 
GMV phagamid vector. This system has an cloning capacity of 12 Kb and 
allows In vivo excision of the cloned fragments. However, the "Rutgers"- and 
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"Basket Pak"-speciflc cDNA libraries produced by Stratagene had a 
representative size of 1.800 bp and 3.5x10° primary piaques. 
Both libraries were screened three times by plaque hybridization following the 
instruction manual of the Stratagene ZAP EXPRESS""** EcoRI Library with the 
^2p-labelad RdRPeoo-DNA flnagment using the 'Random Primed DNA Labeling 
Kit' (Boehringer Mannheim). Altogether 23 recombinant plaamid DMAs that had 
been detected with the radioactive probe were excised In Wwo from the phages 
and finally introduced Into the XLOLR"^" E.coli strain according to the 
Stratagene ZAP EXPRESS"™ EcoR! Library instruction manual. 
Characterization of the plasmid inserts revealed that none of them 
con-esponded to the minimum size of 3.000 bp which could be expected from 
a 120kDa protein. The largest hybridizing EcoRI cDNA fragment (RdRP24) 
was about 2.300 bp in length, and it contained tine entire 3'-part of the RdRP. 
In order to obtain the missing 5'-region of the RdRP cDNA, the 'rapid 
ampiflcatlon of 5'-cDNA ends' (RACE) was performed. This PGR technique 
allows the amplification of cDNA by using one appropriate gene specific 
reverse primer (GSP) for which the sequence Infbnnation about the 5*-end is 
not required. 

Oligo-dT-primed cDNA was synthesized as described (see above) using 
polyA+ RNA that has been Isolated from PSTVd-infected tomato plants. 
According to the "Marathon™ cDNA Amplification Kit" (Clontech) a particular 
adaptor was ligated to both ends of the produced double-stranded cDNA. The 
subsequent PCR amplification was perfomied with a 1:100 dilution of the 
'adaptored* cDNA (1 pl/reaction) using an ollgonucleotkie that was 
complementary to the adaptor sequence (adaptor primer. AP) as a forward 
primer. As reverse primerB, the following three different GSPa (GSP400. 
GSP420. and GSP1200) complementary to the known RdRP24-speclflc 
cDNA sequence were designed: 

GSP400: 5'-CAT AAC GAA TCT GGA AAG CAG ATG G-3' (SEQ ID 

N0.4) 

GSP420: 5'-GAT GAA TCC GGA TCA ACA CCC ACA C-3' (SEQ ID 
NO:5) 



37 



GSP1200:5'-GGG TGC TGG AGG ATA TTC CAT CGG C-3' (SEQ ID 
NO:e) 

Thermal ji^yde parameters as dependent on the annealing temperature of the 
GSP were as follows: 

Program 1 (Tm: 60-65°C) used for GSP400 (Tm=57.9'C) and GSP420 

(Tm-62.1'C): 

94'C for 1 min; 30 cycres: 94'C/30 sec, 60*C/30 sec. 68«C/4 min 
Program 2 (Tm: 65-70'C) used forGSP1200 (Tm=86'C): 
94'C for 1 min: 30 cycles: QA'Cno sec. eS^CM min 

In case of the GSP400-speciflc RACE reaction, a major product of about 
1.900 bp was amplified. In consideration of a 400 bp long overlap with 
RdRP24 this 1 .900 bp fragment contained a RdRP-speclflc 5'-end sequence 
of about 1.500 bp. In contrast to the AP/GSP400-8peclfic PGR product. 
agarose/EtBr gel analysis of the AP/GSP420-specific PCR products revealed 
that shorter DNA fragments had been amplified. Because GSP420 was 
complemantary to the RdRP24 sequence at position +420, the appearance of 
PCR products that were smaller than 1.900 bp in size Indicated that 
predominantly inappropriate cDNAs had been synthesized. The reactions 
performed with the AP/GSP1200 resulted in fragments of 2.700 bp In length. 
After aubstraction of 1.200 bp which are provided by the overlap between 
GSP1200 and RdRP24, the remaining fragment again indicated that the 
RdRP24 cDNA clone has to be extended by 1.500 bp to give a full-length 
RdRP-specific cDNA of about 3.800 bp in size. 

Several inserts of different length were obtained after cloning of the gel- 
purified AP/GSP400-speciflc PCR products into pTPCR (see above). 
Sequence analysis (see above) of the largest RACE clones had shown that 
the cloned insert most liitely comprised the complete 5'-end of the RdRP- 
speclflc cDNA. Computer-supported sequence analysis (DNASIS. Pharmacia) 
of a fusion of the AP/GSP400-8peclflc RACE product with the RdRP24 
sequence revealed the entire RdRP cDNA sequence (SEQ ID NO:1) with an 
Open Reading Frame of 1 114 amino acids and a calculated molecular weight 
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of the RdRP protein of about 127 kDa which is in good agreement with the 
experimental values that had been determined by SDS-PAGE (128 kD) and 
sucrose gradient centrifugatlon (119 kD) (Schlebel et a/., 1993a). 

In order to physically generate of a full-length cDNA clone, many efforts were 
made to connect the RdRP24 fragment with the AP/GSP400-speciflc PGR 
product. Although a suitable unique Aspl restriction site was found to be 
present in the overlapping aera of both subclones, cloning of the gel purified 
DNA fragments Into different vectors and transformation into different E. coli 
strains failed. Because none of these attempts was successful, the PGR 
technique was again utilized to produce the entire RdRP cDNA. 
For this purpose fonvard primers had been designed that were specific to 
region 30 (P127BamA) and to region 140 (P127BamB) of the 5*-nontransiated 
region of the RdRP sequence (see above). As a reverse primer, an 
oligonucleotide (Pl27Bgl) complementary to the 3-nontranslated region 
(position 3630. see above) was synthesized. 

P127Bam1 :5'-CTT GAG GAG GGA TGG AGT GAT GAG TGG CGT CAA G- 

3' (SEQ ID NO:7) 

P127Bam2:5'-GGA TAA GTT GAG GGG GGA TGG AGT TGG TGT TAG 
G-3' (SEQ ID NO:8) 

P127BGL: 5'-GGA GGT TGA TGG AGA TGT AAA GAG AAA AGG TAG 

TG-3' (SEQ ID NO:9) 
To enhance the cloning efficiency these oligonucleotides contained a unique 
restriction site, I. e. a BamHI sites in case of the forward primers, and a Bglll 
site in case of the reverse primer. PGRs were performed with tomato cDNA 
as desclbed above, mnnlng the thermo cycler program 1 and using P127Bgl 
(Tm=65.6*G) in combination with either P127BamA (Tm=72.5"G) or with 
P127BamB (Tma71.1'G). As a result, major products of the expected sizes 
ranging between 3.500 and 3.600 bp were obtained. After double digestion 
with BamHI/Bglll the PGR products were cloned into the plasmid vector 
pT3T7 (Boehringer Mannheim) that had been previously cut with BamHI. 
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Sequence analysis of four independent clones revealed that all RdRP-specific 
inserts represented the entire RdRP cDNA. Unfortunately it turned out that 
they all contained several substitutions when compared with each other, and 
also when compared with the original chimeric sequence. Although most of 
these mutations were found to be "stiir, because only the third position of a 
codon triplett was affected, others could lead to an unwanted translational 
stop. The enor rate of the EuroTaq polymerase (Eurogentec) is obviously too 
high to amplify 3,500 bp fragments withouf nucleotide substitutions. 
Therefore, another PGR approach was perfomned under reaction conditions 
as described above but using the Expand™ High Fidelity PGR System 
(Boehrlnger Mannheim). This system utilizes a mixture of the Taq and the 
Pwo DNA polymerase, the later enzyme possesses an efficient 3'-5' 
proofreading ability. RdRP-speciftc products were again amplified, and after 
sequencing of the con-esponding cloned cDNAs it was demonstrated that a 
lower en-or rate had occun^d within the sequences. Finally, one clone 
(RdRP/HF) only contained two "still" nucleotide substitutions. The mutations 
within the RdRP/HF sequence are presented in figure 3/1, The RdRP/HF 5'- 
BamHI site, and the Bglll site which is located at the 3'-end of RdRP and 
which is destroyed upon cloning a Bglll fragment into a BamHl site are also 
shown. 



Table 1 



BamHl {Bglll/SamHX) 

: 10 1995 2235 :3575 

I** ♦* ** 

RdRP/HT ^rcCACTC-//-C3ATTACTCTC-//-CaACTTC3CGS-//-ATAAGAfiATC C 

30 1 "2020 1 2260 3600 \ 

*| * * I * I* * ♦ I 1* 1 

RdRP cDKX GaAXroACTC-//-GATTACGCTC-//-CGATTTaCGa-//-ATAAaAAAAC T 
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Table 1: Partial alignment of the RdRP/HF and the RdRP cDNA sequences 
The nucleotide substitutions detected within the coding region of the 
RdRP/HF sequence are underlined. Letters set in italics are indicating the 
cloning sites. 

Example 4: Southern analysis of tomato genomic DNA 

Tomato genomic DNA was analysed to determine the copy number and the 
genome organization of the RdRP gene. The knowledge of the RdRP cDNA 
sequence and the computer-supported compilation (DNASIS. Pharmacia) of 
its restriction map allows the detection of gene^exons and gene-introns by 
Southern hybridization of genomic DNA. 

Genomic DNA was extracted from tomato leaves (20g) according to the 
procedure of Bedbrook (1981). ISpg of purified DNA were cut with EcoRl, 
BamHI, Hindlll. Xbal. and Haelll (all enzymes were from Boehringer 
Mannheim), respectively, and electrophorized on a 0.8% TAE agarose gel 
(Sambrook ei aL. 1989). After the denaturation and renaturation of the 
agarose gel. the DNA was transfenred (Vacuum Blotter, Appligene) to 
positively charged Nyfon Plus membrane (Qiagen), and finally UV3i2nm- 
crosslinked (0.3J/cm*). Hybridization of the Southern blot against radom 
primed ^P-labeled RdRP24 DNA (see above) was performed as described by 
Ama8ino(1986). 

The autoradlograph of the Southern blot (exposed for 16 hours) is shown in 
figure 1. The endonucleases EcoRI and BamHI do not cut within the RdRP 
cDNA and most likely there is also no restriction site wrthin the RdRP gene. 
Therefore the single band of about 10.000 bp (lane 1 and lane 2), 
respectively. Indicates that the RdRP gene exists as a single copy gene within 
tomato genome. This assumption is supported by the occurrence of only a 
single hybridizing fragment in lane 4. Because Xbal cuts only once wtthin the 
RdRP cDNA, the second Xbal site must be located in the flanking genomic 
DNA close to the RdRP gene sequence. As a consequence, if the plant 
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contains two Independent RdRP genes two fragments should be released by 
Xbal. 

The banding pattern of the hybridizing Hindlll fragments cleariy demonstrates 
that the RdRP gene is composed of exons and introns. The size of the Hfndlll 
fragments that can be deduced from the restriction map of RdRP cDNA 
sequence should result in a 130 bp fragment and two border fragments with 
sizes larger than 600 bp. The appearance of a 350 bp fragment (lane 3) can 
only be explained by an additional Hindlll site that is located within an intron 
sequence. 

A similar situation is given when the Haelll digest is analysed. The expected 
size of a Hae Ill-specific fragment is about 990 bp. However, there is no signal 
visible on the autoradlograph that corresponds to this size range (lane 6). The 
larger hybridizing fragment of 1.500 bp In size must be due to an intron within 
the 990 bp cDNA fragment 

A more detailed analysis of the tomato RdRP gene by characterization of 
clones that had been Isolated from two custom ZAP EXPRESS™ EcoRI 
cDNA genomic libraries (Stratagene) is in progress. So far, the entire 
sequence of the coding region could be confirmed, and the existence of at 
least three intnons couid also be demonstrated (data not shown). 

Example 5: Southern analysie of different plant genomic ON As 

Of particular interest is the examination of the distribution of the RdRP gene 
over different plant species. Therefore Southern hybridization was performed 
as described above, but now Hindlll-restricted genomic DNA of the two 
different tomato cultlvars 'Rutgers' and 'St. Pienre, as well as nuclear DNA of 
potato and tobacco was analysed. The autoradlograph of the Southern blot 
(exposed for 16 h lane 1 .3, and 4; exposed for 72 h lane 2) revealed that the 
RdRP gene is detectable in all four genomes (Fig. 2). The strong cross- 
hybridization of the potato ONA with the tomato-specific probe (lane 1 ) Is not 
unexpected because potato is a plant that is closely related to tomato. The 
weak 'tobacco signals' (lane 2) might be due to the fact that tobacco is nrtore 
distantly related to tomato. In addition, the tobacco genome is much more 
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complex than that of tomato and potato, respectively. In case the same 
amounts of genomic DNA are loaded on the agarose gel, the number of a 
particular DNA sequence, for example that of the RdRP gene, Is less well 
represented when the genome Is more complex. 

Nevertheless, the existence of a tobacco RdRP gene was also demonstrated 
by the characterization of a 850 bp long PGR product that had been amplified 
with genomic tobacco DNA and using tomato RdRP cDNA-specific primers 
(data not shown). 

Example 6: Northern analysis of total tomato RNA 

Total RNA was isolated as described above from PSTVd-lnfected and from 
vlroid-free tomato plants (cv. Rutgers and cv. 'Basket Pak'). Separation of 
total RNAs (15Mg/lane) was perfonned in phosphate-buffered 1.5% agarose 
gels. The RNAs were pretreated with 1 vol. DMSO-mIx In a final volume of 50 
Ml and heat-denaturated at 85'C for 10 min, (Splesmacher et el., 1985), The 
RNAs were transferred onto non-charged nylon membranes (Qiabrane, 
Qlagen) by capillar blotting as described by Sambrook et aL (1989) and 
hybridized against radom primed **P-labeled RdRP24 DNA (see above). 
The autoradiograph of the Northern analysis which was exposed for 16 h 
shows that the RdRP gene is transcribed, and moreover It shows that 
transcription is induced upon PSTVd-lnfection (Fig 3. compare lane 1 and 3 
with lane 2 and 4). This result is in good agreement with the observation that 
the amount of RdRP protein Is also increased in viroid infected tomato plants. 

Example 7: Verification of the RdRP cDNA encoded protein 

In order to examine whether the RdRP cDNA encoded protein (C-proteln) Is 
identical with the isolated enzyme, four different C-protein-speclflc antibodies 
had been produced (Eurogentec). From the entire AS sequence of the C- 
proteln the following peptides had been synthesized and were chosen for 
Immunisation of rabbits (Eurogentec): 

P430! SNRVLRNYSEDIDN {SEQ ID NO:10) 

P431: ASKTFDRRKDAEAl (SEQ ID NO:11) 
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P432: EQYDGYLKGRQPPKSPS (SEQ ID NO:12) 

P433: VFPQKGKRPHNEC (SEQ ID NO:13) 
The specific reaction of each antiserum with the RdRP protein isolated from 
the tomato leaf tissue was tested by Western blotting (Sambrook et eA, 1989). 
The four antisera were able to Immunologically detect a protein at the position 
of the RdRP activity (data not shown). Subsequently the two most sensitive 
antisera were applied onto a peptide/antigen matrix column for affinity 
purification (Eurogentec). The purified P431 antiserum (Ap43^) was used in all 
experiments described below. 

Specific immunoioglcai detection of the active RdRP protein with Ap43i was 
analysed with elution fractions from four different chromatographic njns. 1 mI 
aliqiods of those fractions that contained the highest RdRP activity were 
applied onto SDS-PAGE (Phast system, Pharmacia) and were then subjected 
to Westem analysis (Fig, 4). As shown In figure 4 the Intensity of the Ap43i- 
specific signal correlates with the enzyme activity in that only those fractions 
reacted with Ap^a^ that also contained active RdRP. As a control the protein 
sample that was run In lane 1 originates from the resuspended '30.000 x g 
precipitate' (see above) that was loaded on the Q Sepharose FF column. The 
additional signal that is visible in lane 2 and 3 was demonstrated to be due to 
reaction of a pre-serum-speciflc antibody with a protein that is eluted from the 
Q-Sepharose FF column together with the RdRP protein. 
Additional evidence of the avemnent that the RdRP cDNA encodes the RdRP 
enzyme came from a gel filtration experiment (Fig, 5). The Sephadex G-200 
gel system is used to separated proteins according to their molecular weight. 
This size fractionation procedure was originally conceived to further purify the 
plant RdRP protein. Surprisingly enough the retention volume for the elution 
of the tomato enzyme corresponded to a protein with a size of less than 65 
kDa. As shown in figure 4 the highest RdRP activity was found in fraction 15 
and not in fraction 13 as expected. But with the aid of SDS-PAGE analysis it 
was demonstrated that fraction 15 contained a 128 kDa protein and moreover 
this protein was detected by Ap43i (Fig 4., Immuno-blot, lane 15), 
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The present invention is not to be limited in scope by the specific 
embodiments described which are intended as single illustrations of individual 
aspects of .the invention, and any nucleic acid molecules, proteins* constructs 
or antibodies which are functionally equivalent are within the scope of this 
invention. Indeed, various modifications of the invention in addition to those 
shown and described herein will become apparent to those skilled In the art 
from the foregoing description and accompanying drawings. Such 
modifications are Intended to fall within the scope of the appended claims. 
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SEQUENCE LISTING 



(L) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Max-Planck-QeMllschaft zux Foerderung dcr 

Wiasenacbaftcn e. V. 

(B) STREET: none 

(C) CITY: Berlin 

(E) COUNTRY: DE 

(F) POSTAL CODE (ZIP): none 

(ii) TITLE OF INVENTION: Nucleic acid molecule* encoding an 

RKA-dircctcd RNA polymerase (RdRP) 

(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC coxiapatiblc 

(C) OPERATINQ SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patcntln Relea«c #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3731 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: liaeax 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Tomato 

(ix) FEATURE: 

(A) NAME«EY: CDS 

(B) LOCATION: 194..3535 



(xi) SEQUENCE DESCRIPllON: SEQ ID NO: I: 

GAAATATTCI TTAdTACIT CACCAGGGAT TGACTCATCA CTCCCCTCAA QTCTTTGTQT 
60 

GTTQTGATAA TAAATTTQGT TQTQCTTCAQ TTTCAQTCAC TACTOCTGGG TAGTmTAT 
120 

TTTQCATAAC TTCAGGGGGT ATTCCAGTTG GTGTTAGCAT TTGAAAGTCO 
AACTGCACTT 180 

QQAATTTGGC TAC ATQ QGA AAG ACA ATT CAG GTT TTC GGA TTC CCT TAT 229 
Met Gly Lys Thi He Gla Val Phe Gly Phe Pro Tyr 
1 5 10 
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CTTCTCTCraCQQAAGTGGTTAAOTCATTCTTAGAGAAATATACAG^ 277 
Leu Leu Ser Ala Ghi Val Val Lya Ser Phc Leu Glu Lys Tyr Thr Oly 
15 20 25 

TAT GGA ACT GTA TOT GCA TTQ GAG GTT AAA CAG TCC AAA GQA QQA TCT 325 
Tyr Gly Thr Val Cys Ala Leu Ghi Val Lys Gin Sor Ly» Gly Qly Scr 
30 35 40 

AGA GCA TIT QCC AAA GTT CAA TTT GCC GAC AAC ATA AGT GCT GAG AAA 373 
Arg Ala Phe Ala Lya Val Gin Pho Ala Asp Aan lie Ser Ala Asp Lya 
45 50 55 60 

ATC ATC ACT TTG GCT AAT AAC AQG CTG TAT TTT QGC TCT TCT TAT TTQ 42 1 
lie n© Thr Leu Ala Asa Aan Aig Leu Tyr Phe Gly Ser Sor Tyr L«fU 
65 70 75 

AAQ GCT TGG GAA ATG AAA ACT GAT ATT QTC CAA CTG CGG GCA TAT QTQ 469 
Lys Ala Trp Glu Mot Lys Thr Asp lie Val Gin Leu Arg Ala Tyr Val 
SO 85 90 

GAT CAG ATG GAT GGC ATA ACT TTQ AAT TTC GGA TGT CAQ ATA TCA GAT 5 1 7 
Asp Gin Met Asp Qly lie Thr Leu Ain Pho Gly Cys Gin De Scr Asp 
95 100 105 

GAC AAG TTT GCA GTG TTQ GQA AQT ACA GAA GTT TCA ATT CAA TTT GGC 565 
Asp Lys Phc Ala Val Lou Gly Sex Thr Ghi Val Scr He Gin Pho Gly 
110 115 120 

ATT QQA TTQ AAQ AAA TTT TTT TTC TTT TTA TCT AQT QQT TCA GCT GAC 613 
He Qly Lou Lys Ly» Pho Phc Phe Phc Leu Scr Scr Gly Scr Ala Asp 
125 130 135 140 

TAT AAA CTT CAQ CTT TCA TAT GAA AAT ATA TOG CAG GTT GTG CTC CAT 66 1 
Tyr Lys Leu Ghi Leu Ser Tyr Ghi Aan Do Trp Gin Val Val Leu Hia 
145 150 155 

CGT CCA TAT QQT CAA AAT GCT CAG TTT CTC CTC ATA CAG TTA TIT GGT 709 
Axg Pro Tyr Gly Ghi Ain Ala Gin Phc Lou Lou lie Ghi Leu Phe Gly 
160 165 170 

OCT CCT CGG ATC TAT AAG AGA CTT GAA AAC TCC TGT TAT AGC TTC TTT 757 
Ala Pro Arg De Tyr Lys Arg Leu Qhi Aan Ser Cys Tyr Ser Phc Phe 
175 180 185 

AAQ GAA ACT CCT GAT GAT CAG TGG QTQ AGO ACA ACA GAT TTC CCT CCA 805 
Lys Ghi Thr Pro Asp Asp Gin Trp Val Arg Thr Thx Asp Phe Pro Pro 
190 195 200 

TCT TGG ATA QQQ CTA TCT TCT AGC TTA TGT TTG CAQ TTC CQT AGG GGT 853 
Ser Trp He Gly Leu Ser Ser Ser Leu Cys Leu Qln Pho Arg Arg Gly 
205 210 215 220 

GTT CQT CTT CCA AAT TTC GAG GAA AGT TIT TTC CAC TAT OCA GAA CQT 90 1 
Val Axg Leu Pro Aan Phc Ghi Ghx Ser Pho Phe His Tyr Ala Ghi Arg 
225 230 235 

GAA AAC AAT ATT ACT TTA CAG ACT GOT TTC ACC TTT TTC QTC TCT CAA 949 
Glu Asn Asn Do Thr Leu Gla Tbx Gly Phc Thr Phe Pho Val Ser Gin 
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240 245 250 

AAA TCQ OCT CTG GTT CCC AAT GTC CAG CCT CCG QAA GGA ATT TCA ATT 997 
Lys Ser Ala Leu Val Pro Aan Val Gin Pro Pro Ghi Qly He Scr He 
255 - - 260 265 

CCC TAG AAQ ATT TTG TTC AAA ATT AGT TCT TTG GTA CAG CAT GGA TGC 1 045 
Pro Tyr Lya lie Leu Phe Ly§ Do Ser Ser Leu Val Gin Hi* Gly Cys 
270 275 280 

ATA CCT OGG CCA GCA TTA AAT GTC TAC TTT TTC CGA TTA GTT GAT CCT 1 093 
He Pro Gly Pro Ala Leu Asn Val Tyr Phe Phe Arg Leu Val Asp Pro 
285 290 295 300 

CGA AGG AQA AAT GTG GCA TQC ATT GAG CAT QCC TTA GAG AAA CTG TAC 1 141 
Arg Axg Arg Aan Val Ala Cyt lie Glu Hia Ala Leu Ghi Lya Leu Tyr 
305 310 315 

TAT ATA AAG GAG TGC TOT TAT GAT CCC GTG AGG TGG CTC ACT GAG CAG 1 189 
Tyr nc Lys Ghi Cys Cys Tyr Asp Pro Val Arg Trp Leu Thr Glu Gin 
320 325 330 

TAT GAT GOG TAT CTC AAG GGT AGA CAA CCT CCA AAA TCT CCG TCC ATC 1 237 
Tyr Aap Gly Tyr Leu Lya Qly Arg Gin Pro Pro Lys Ser Pro Ser He 
335 340 345 

ACT TTA GAT GAT OGG TTG GTG TAT GTA AGA AGG GTC CTA GTA ACA CCA 1285 
Thr Lou Asp Asp Gly Leu Val Tyr Val Arg Arg Val Leu Val Thr Pro 
350 355 360 

TQC AAA GTT TAT TTT TGT GGT CCA GAG GTT AAT GTT TCC AAT CGG GTT 1333 
Cys Lys Val Tyr Phe Cya Gly Pro Glu Val Asn Val Ser Aan Arg Val 
365 370 375 380 

CTC CGC AAT TAT TCT QAA GAC ATA GAT AAC TTT CTT CGT GTT TCT TTT 1381 
Leu Arg Asn Tyr Ser Glu Asp lie Asp Aan Phe Leu Arg Val Ser Phe 
385 390 395 

GTT GAT GAG GAG TGG GAQ AAA CTG TAT TCT ACA QAC TTA TTA CCA AAA 1 429 
Val Asp Ghx Glu Trp Glu Lys Leu Tyr Ser Thr Asp Leu Leu Pro Lya 
400 405 410 

GCA AGT ACT GGA AGT GGT GTC AGG ACA AAC ATC TAT GAQ AGG ATC TTA 1477 
Ala Ser Thr Gly Ser Gly Val Arg Thr Asa He Tyr Ghi Arg He Leu 
415 420 425 

TCA ACT CTG CGG AAA GGC TTT GTA ATT GGT GAT AAA AAA TTT GAA TTT 1 525 
Scr Hit Leu Arg Lys Qly Phe Val He Gly Asp Lys Lys Phe Glu Phe 
430 435 440 

CTT GCA TTT TCA TCQ AQC CAG TTG CGG GAT AAT TCA GTG TGG ATG TIT 1573 
Leu Ala Phe Ser Ser Ser Ghi Leu Arg Asp Asn Ser Val Trp Met Phe 
445 450 455 460 

GCA TCA AQA CCT QQC CTT ACT GCA AAT GAT ATA AGA GCT TGG ATQ GGT 1621 
Ala Ser Arg Pro Qly Leu Thr Ala Asn Asp ne Arg Ala Trp Met Gly 
465 470 475 
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GAT TTT TCQ CAG ATC AAG AAT GTC GCA AAA TAT OCT QCC AGA CTT GGT 1669 
Asp Phe Ser Ghi He Lys Ami Val Ala Lys Tyr Ala Ala Arg Leu Gly 
480 485 490 

CAA TCT TTT GGT TCC TCC AGA GAG ACT TTG AGT QTT CTT AGG CAT GAG 1717 
Gin Scr Phe Qly Ser Set Arg Ghj Thr Leu Ser Val Leu Arg His Ghi 
495 500 505 

ATT GAA QTT ATT CCC GAT QTA AAG GTT CAT QGA ACC AGC TAT GTC TTT 1765 
Dc Ghi Val He Pro Asp Val Lys Val Hia Qly Thr Ser Tyr Val Pho 
510 515 520 

TCT QAT QGA ATT GGT AAA ATA TCT GGT QAC TTT GCT CAT AQA GTT GCC 1813 
Ser Asp Gly De Qly Lys He Ser Gly Asp Phc Ala His Aig Val Ala 
525 530 535 540 

TCA AAA TGT QQC CTT CAA TAT ACC CCA TCT GCT TTC CAG ATT CGT TAT 1 86 1 
Ser Lys Cys Gly Leu Gin Tyr Thr Pro Ser Ala Phe Gin ne Axg Tyr 
545 550 555 

QQT GGA TAT AAA QQT QTT GTG GGT GTT QAT CCG GAT TCA TCA ATQ AAG 1909 
Gly Gly Tyr Lys Qly Val Val Gly Val Asp Pro Asp Ser Ser Met Lys 
560 565 570 

TTG TCT TTG AQA AAQ AGC ATG TCG AAA TAT GAA TCA QAC AAC ATA AAG 1 957 
Leu Ser Leu Arg Lys Ser Met Ser Lys Tyr Glu Sex Asp Aan lie Lys 
575 580 585 

TTA QAT GTC CTT GGA TGG AQC AAA TAT CAG CCT TGT TAT CTT AAT CGT 2005 
Leu Asp Val Leu Qly Trp Ser Lys Tyt Gin Pro Cys Tyr Leu Asn Arg 
590 595 600 

CAA CTQ ATT ACQ CTC TTG TCT ACA CTT QGA GTG AAA GAT QAA QTT CTC 2053 
Qln Leu De Thr Leu Leu Ser Thr Leu Gly Val Lys Asp Glu Val Leu 
605 610 615 620 

GAA CAQ AAG CAA AAG GAA OCT GTA GAT CAG CTT QAT GCT ATC TTG CAT 2101 
Glu Gin Lya Gin Lys Glu Ala Val Asp Gin Leu Asp Ala He Leu His 
625 630 635 

QAT TCT TTG AAG GCA CAQ GAG GCT TTQ GAA TTQ ATG TCT CCT GGA QAQ 2 149 
Asp Ser Leu Lya AU Gin Glu Ala Leu Qhi Leu Met Ser Pro Gly Glu 
640 645 650 

AAC ACT AAT ATT CTC AAQ OCA ATG CTA AAC TGT QQT TAT AAG CCT GAT 2197 
Asn Thr Aan lie Leu Lys Ala Met Leu Asn Cys Qly Tyr Lys Pro Asp 
655 660 665 

OCT GAG CCC TTT err TCA ATQ ATQ TTG CAA ACC TTC CQC GCA TCC AAG 2245 
Ala Glu Pro Phe Leu Ser Met Met Leu Gin Thr Phe Arg Ala Ser Lys 
670 675 680 

TTQ CTC QAT TTQ CGG ACT AGA TCA AGA ATA TTT ATT CCA AAT QGA AGA 2293 
Leu Leu Asp Leu Arg Thr Aig Ser Arg De Phe He Pio Asn Gly Arg 
685 690 695 700 

ACA ATG ATG GGA TGT TTG QAT GAA TCC AQA ACC TTG GAA TAT GGT CAQ 2341 
Thr Met Met Gly Cys Leu Asp Glu Ser Arg Thr Leu Qhi Tyr Qly Gin 
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705 710 715 

QTG TIT QTT CAG TIT ACT GOT GCT GGA CAT GGA GAG TIT TCT GAC GAT 
Val Phe Val Gin Phe Thr Qly AU Gly Hia Qly Ghi Phc Ser Aap Asp 
720. 725 730 
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TTA CAT CCA TTT AAT AAC AGC AGA TCC ACC AAC AQT AAT TTC ATT CTG 2437 
Leu His Pro Phe A*n Asa Ser Arfi Ser Hit Aan Ser Aan Phe lie Leu 
735 740 745 

AAQ GGA AAT QTG GTI GTT GCA AAA AAT CCA TOC TTG CAT CCT GGT GAT 2485 
Lys Gly Aan Val Val Val Ala Lys Aan Pro Cys Leu Hia Pro Gly Aap 
750 755 760 

ATT CGT QTT TTA AAG GCT GTA AAT QTT CGA GCG CTG CAC CAC ATG GTA 2533 
ne Arg Val Leu Lys Ala Val Aan Val Arg Ala Lou His His Met Val 
765 770 775 780 

GAT TGT GTT GTA TTC CCT CAG AAA GGA AAA AGA CCT CAT CCG AAT GAA 2581 
Aap Cya Val Val Phe Pro Gin Lyi Gly Lys Aig Pro Hia Pro Asa Glu 
785 790 795 

TGT TCT OQQ AGT GAT TTG GAT GGG GAT ATC TAC TTT GTT TGC TOG GAT 2629 
Cys S«r Gly Ser Asp Leu Asp Gly Aap Ilo Tyr Piie Val Cys Ttp A«p 
800 805 810 

CAA GAC ATG ATC CCG CCA AGO CAA GTC CAG CCG ATG GAA TAT CCT CCA 2677 
Gin Asp Met ne Pro Pro Arg Gin Val Gin Pro Met Ghi Tyr Pro Pro 
815 820 825 

GCA CCC AQC ATA CAG TTG GAC CAT GAT QTC ACA ATT GAG GAA GTT GAA 2725 
Ala Pro Scr He Gin Leu Asp His Aap Val Thr lie Glu Glu Val Ghi 
830 835 840 

GAG TAC TTC ACC AAC TAT ATT GTG AAT GAC AGT TTG GGA ATC ATA GCA 2773 
Glu Tyr Phe Thr Aan Tyr lie Val Aan Asp Scr Leu Gly He lie Ala 
845 850 855 860 

AAT GCC CAT GTC GTA TIT GCA GAC AGA GAA CCT GAT ATG GCC ATG AQT 2821 
Asa AJa His Val Val Phe Ala Asp Arg Glu Pro Asp Met Ala Met Ser 
865 870 875 

QATCCATGCAAAAAACTTGCTGAGCTCTTrTCAATrGCAGIXjQACTTT 2869 
Asp Pro Cys Lys Lys Lm Ala Glu Leu Phe Scr He Ala Val Asp Phe 
880 885 890 

CCA AAG ACT GOT GTT CCC GCT GAA ATA CCA TCT CAG TXXjCGC CCT AAA 2917 
Pro Lys Thr Gly Val Pro Ala Glu He Pio Scr Ghi Leu Arg Pro Lys 
895 900 905 

GAA TAC CCA GAC TTC ATG GAT AAQ CCG GAC AAG ACC AGC TAT ATC TCA 2965 
Glu Tyr Pro Aap Phe Met Asp Ly s Pro Aap Lys Thr Ser Tyr lie S or 
910 915 920 

GAAAQAGTrATrQGAAAGCTrTTCAGGAAAGTGAAOQACAAAGCACCT 3013 
Glu Are Val He Gly Lya Leu Phc Arg Lys Val Lys Asp Lys AU Pro 
925 930 935 940 
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CAG OCT AGC TCT ATC GCQ \CC TTC ACA AQA GAT QTT GCA AQG AGA TCA 306 1 
Qln Ala Ser S«r He Ala Thr Phe Thr Aig Asp Val Ala Arg Arg Ser 
^5 950 955 

TAT GAT GCT GAT ATQ OAA GTT GAT GGA TTT GAA GAT TAG ATT GAC GAA 3109 
Tyr Asp Ala Asp Met Glu Val A«p Gly Phe Glu Asp Tyr lie Aap Qlu 
960 965 970 

GCT TTT OAC TAG AAA ACT GAA TAT GAC AAC AAG CTG GGT AAT TTA ATG 3 157 
Ala Phe Asp Tyr Lys Thr Ghi Tyr Asp Aan Lys Leu Gly Asa Leu Met 
975 980 985 

GAC TAG TAT OGC ATA AAA ACA GAG GCT GAA ATA CTT AQT GGT GGC ATT 3205 
Asp Tyr Tyr Gly Do Lys Thr Qlu Ala Ghi Gc Leu Ser Gly Qly lie 
990 995 1000 

ATQ AAG GCA TCA AAA ACT TTT GAC CGC AGA AAA GAT GCT GAG GCC ATT 3253 
Met Lys Ala Ser Lys Thr Phe Asp Arg Arg Lys Asp Ala Glu Ala lie 
1005 1010 1015 1020 

AGT GTT GCT GTG AGG GCC TTG AGO AAG GAG GCA AGA GCC TGG TTC AAG 3301 
Ser Val Ala Val Arg Ala Leu Arg Lys Glu Ala Arg Ala Tjp Phe Lys 
1025 1030 1035 

AQG CGT AAT GAT ATA GAT GAC ATG TTA CCA AAG GCT TCG GCT TGG TAG 3349 
Arg Arg Am Asp lie Aip Asp Met Leu Pro Lys Ala Ser Ala Ttp Tyr 
1040 1045 1050 

CAC GTT ACA TAT CAT CCT ACA TAT TGG GGT TGC TAC AAT GAG GGO TTG 3397 
HiaValThrTytHisProThrTyrTipGlyCysTyr Am Gin Qly Leu 
1055 1060 1065 

AAA AGA GCT CAT TTC ATT AGC TTT CCC TGG TGT GTT TAT GAC CAG OTA 3445 
Lya Aig Ala His Phe lie Ser Phe Pro Trp Cys Val Tyr Asp Gla Leu 
1070 1075 1080 

ATC CAG ATT AAG AAG GAC AAA GCA CGT AAC AGG CCA GTT CTC AAC TTG 3493 
He G2jq He Lys Lys A«p Lys Ala Arg Asa Arg Pro Val Leu Asn Leu 
1085 1090 1095 1 100 

TCA TCT CTC AQG OCT CAA CTG AQT CAC AGA TTA GTG TTG AAA 3535 
Ser Ser Leu Arg Ala Gha Lea Ser His Arg Leu Val Leu Lys 
1105 1110 

TGAGATTCCA GTCGAGCGTT AAGCTGATAT ATATATAATQ TAATACGGTQ 
TGATCATAAG 3595 

AAAACTGTTA TQCATTGTTG ACTACCTTTT GTCTTTAAAA CTGCATQAAG CTGCAACATA 
3655 

TATGCAGTAC TCTAAQAAAC AGATGTACAG CTAAOTACTA ATATGTATGT 
GATTTGAGTT 3715 

TCATCTTTCT TCTAAA 373 1 



(2) INPORMATION for SEQ JOD NO: 2: 
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(x) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: U14 amipo ocida 

(B) TYPE: amino acid 
(D) TOPOLOGY: lincai 

(ii) MOIJECULB TYPE: protem 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Mctaiy Lys Thr He Gla Val Pho Gly Phe Pro Tyr Leu Leu Ser Ala 

1 5 10 15 

Ghi Val Val Lya Scr Phe Leu Glu Lys Tyr Thr Gly Tyr Gly Thr Val 
20 25 30 

Cy8 Ala Leu Qlu Val Lys Qln Ser Lys Gly Gly Scr Arg Ala Phe Ala 
35 40 45 

Lys Val Ghi Phe Ala Asp Am He Set Ala Asp Lys He He Thr Leu 

50 55 60 

Ala Aki Aan Arg Leu Tyr Phe Gly Ser Scr Tyr Leu Lys Ala Trp Ghi 

65 70 75 80 

Met Lyi Thr Asp He Val Gla Leu Arg Ala Tyr Val Asp Ghx Met Aap 
85 90 95 

Gly He Thr Leu Aan Phe Gly Cys Ghi Do Ser Asp Asp Lys Phe Ala 

100 105 110 

Val Leu Gly Ser Thr Qlu Val Scr He Ghi Phe Gly Ee Gly Leu Lys 
115 120 125 

Lys Phe Pha PhePhc Leu Scr Ser Gly Scr Ala Asp Tyr Lys Leu Q]n 

130 135 140 

Leu Scr Tyr Ghi Aan lie Trp Qhi Val Val Leu His Arg Pro Tyr Gly 
145 150 155 160 

Q]n Asn Ala Ghi Phe Leu Leu lie Gk Leu Phe Gly Ala Pro Arg He 

165 170 175 

Tyr Lys Arg Leu Glu Asn Ser Cys Tyr Ser Phe Phe Lys Glu Thr Pro 
180 185 190 

Asp Asp GhiTrp Val Arg Thr Thr Asp Phe Pro Pro Scr Trp He Gly 
195 200 205 

Leu Ser Ser Ser Leu Cya Leu Ghi Phe Arg Arg Gly Val Arg Leu Pro 
210 215 220 

Asn Phe Glu Qhi Ser Phe Phe Hia Tyr AU Glu Arg Glu Aan Asa He 
225 230 235 240 

Thr Leu Ghi Thr Gly Phe Thr Phe Phe Val Scr Ghi Lys Scr Ala Leu 
245 250 255 

Val Pro Aan Val Ghi Pro Pro Glu Oly lie Ser He Pro Tyr Lys He 
260 265 270 
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Leu Pho Lys Ue Scr SerLeu Val Gin His Gly Cya He Pro Qly Pro 
275 280 285 

Ala Leu Asa Val Tyr Phe Phc Ai:g Leu Val Asp Pro Aig Arg Arg Am 
290 295 300 

Val Ala Cys lie Qlu His Ak Leu GIu Lys Lou Tyr Tyr He Lys Ghi 
305 310 315 320 

Cys Cy% Tyr Asp Pro Val Arg Trp Leu Thr Qhx Gin Tyr Asp Gly Tyr 
325 330 335 

Leu Lys Gly Arg Glu Pro Pro Lya Ser Pro Sar lie Thr Leu Asp Asp 
340 345 350 

Gly Leu Val Tyr Val Arg Arg Val Leu Val Thr Pro Cys Lys Val Tyr 

355 360 365 

Phe Cys Gly Pro Glu Val Asn Val Ser Aan Arg Val Leu Arg Asn Tyr 
370 375 380 

Ser Glu Asp He Asp Asn Phe Leu Arg Val Ser Phe Val Asp Glu Ghi 
385 390 395 400 

Trp QluLysLeuTyrSerThr AspLeuLeuProLys AlaSerThrGly 
405 410 415 

Ser Gly Val Arg Thr Am Da Tyr Ghi Arg He Leu Ser Thr Leu Arg 
420 425 430 

Lya Gly Phe Val lie Gly Asp Lys Lya Phe Ghi Phe Leu Ala Phe Ser 
435 440 445 

Ser Ser Ghi Leu Arg Asp A sn Ser Val Trp Met Phe Ala Ser Arg Pro 
450 455 460 

Qly Leu Thr Ala Aan Asp He Arg Ala Trp Met Gly Asp Phe Scr Qhi 
465 470 475 480 

He Lya Asn Val Ala Lys Tyr Ala Ala Arg Leu Qly Ghi Ser Phe Gly 
485 490 495 

Ser Ser Arg Glu Thr Leu Ser Val Leu Arg His Glu lie Glu Val He 
500 505 510 

Pro Asp Val Lys Val His Gly Thr Ser Tyr Val Phe Ser Asp Gly He 
515 520 525 

Gly Lys Dc Ser Gly Asp Phe Ala His Arg Val Ala Sar Lys Cys Gly 
530 533 540 

Leu Gin Tyr Thr Pro Ser Ala Phe Ghi ne Arg Tyr Qly Qly Tyr Lys 
545 550 555 560 

Gly Val Val Gly Val Asp Pro Asp Ser Scr Met Lys Leu Ser Leu Arg 
565 570 575 
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Lya Ser Met Ser Lys Tyr Ghi Ser Asp Asn Re Lys Leu Asp Val Leu 
550 585 590 

Gly Trp Ser Lyi Tyr Gin Pro Cya Tyr Leu Am Arg Gha Leu Tie Thr 
595 - 600 605 

Leu Leu Ser Thr Leu Gly Val Lys Asp Glu Val Leu Ghi Gin Lys Ghi 
610 615 620 

Lys Qhi Ala Val Asp Gin Leu Asp Ala lit Leu His Asp Ser Leu Lys 
625 630 635 640 

Ala Gta Glu Ala Leu Glu Leu M« Ser Pro Gly Glu Asn Tbr Asn lie 
645 650 655 

Leu Lys Ala Met Leu Asn Cyi Gly Tyr Lys Pro Asp Ala Glu Pro Phe 
660 665 670 

Leu Ser Met Met Leu Gin Thr Phe Arg Ala Ser Lya Leu Leu Asp Leu 
675 680 685 

Arg Thr Arg Ser Arg De Phe lie Pro Asn Gly Aig Thi Met Met Gly 
690 695 700 

Cys Leu Asp Ghi Ser Arg Thr Lou Glu Tyr Gly Gin Val Phe Val Gbi 
705 710 715 720 

Phe Thr Gly Ala Gly His Gly Glu Phe Ser Asp Asp Leu His Pro Phe 
725 730 735 

Asn Asn Ser Arg Ser Thr Asn Ser Asn Phe Ha Leu Lys Gly Asn Val 
740 745 750 

Val Val Ala Lys Asn Pro Cys Leu His Pro Gly Asp He Aig Val Leu 
755 760 765 

Lys Ala Val Asn Val Arg Ala Leu His His Met Val Asp Cys Val Val 
770 775 780 

Phe Pro Gin Lys Gly Lys Arg Pro His Pro Asn Ghi Cys Ser Gly Ser 
785 790 795 800 

Asp Leu Asp Gly Asp flc Tyr Pho Val Cys Tip Asp G]n Asp Met lie 
805 810 815 

Pro Pro Arg Gha Val Gin Pro Met Glu Tyr Pro Pro Ala Pro Ser lie 
820 825 830 

Gin Leu Asp His Asp Val Thr De Glu Qhi Val Glu Ghi Tyr Phe Thr 
835 840 845 

Asn Tyr lie Val Asn Asp Ser Leu Gly He lie Ala Asn Ala His Val 
850 855 860 

Val Phe Ala Asp Arg Glu Pro Asp Met Ala Met Ser Asp Pro Cys Lys 
865 870 875 880 

Lys Leu Ala Ghi Leu Phe Ser lie Ala Val Asp Phe Pro Lys Thr Gly 



TT 
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8fi5 890 895 

Val Pro Ala Glu ne Pro Ser Gin Leu Aig Pro Lys Ghj Tyr Pro ASP 
900 905 910 

Pbe Met Aap Lys Pro Asp Lys Thr Sot Tyr Ik Scr Ghi Are Val fle 
915 920 925 

Gly Lya Lou Pbc Arg Lys Val Lyi Asp Lys Ala Pro Gin Ala Sex Ser 
930 933 940 

ne Ala Tbr Phe Thr Arg Asp Val Ala Aig Arg Ser Tyr Asp Ala Asp 
945 950 955 960 

Met Glu Val Asp Gly Phe Glu Asp Tyr lie Asp Qlu Ala Phe Asp Tyr 
965 970 975 

LysThrOluTyr AapAsn Lya L«u Gly Asn Leu Met Asp Tyr Tyr Gly 
980 985 990 

ne Lys Tlxr Qlu Ala Ghi He Leu Ser Gly Gly lie Met Lys Ala Ser 
995 1000 1005 

Lya Thr Phe Asp Axg Arg Lys Asp Ala Qhi Ala lie Ser Val Ala Val 
3010 1015 1020 

Arg Ala Leu Arg Lys Ghi Ala Arg Ala Tip Phe Lys Arg Arg Aan Asp 

1025 1030 1035 1040 

He Asp Asp Met Leu Pro Lys Ala Ser Ala Tip Tyr His Val Thr Tyr 
1045 1050 1055 

His Pro Thr Tyr Ttp Qly Cys Tyr Asn Gin Gly Leu Lya Arg Ala His 
1060 1065 1070 

Phe He Ser Phe Pro Ttp Cys Val Tyr Asp Gb Leu lie Gla He Lys 
1075 1080 1085 

Lys Asp Lya Ala Arg Asn Arg Pro Val Leu Asa Leu Ser Ser Leu Arg 
1090 1095 1100 

Ala Ghi Lou Ser Hia Arg Leu Val Leu Lya 
1105 1110 



